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CALCULATION OF REGIONAL DEPOSITION OF AEROSOL 
IN THE RESPIRATORY TRACT 


BERNARD ALTSHULER 
INSTITUTE OF INDUSTRIAL MEDICINE 
NEW YORK UNIVERSITY-BELLEVUE MEDICAL CENTER 


The removal of air-borne particles in the respiratory tract is treated to 
enable regional deposition to be inferred from measurement of expired 
aerosol as well as predicted from theory of the primary removal processes. 
The analysis uses the analogy of a continuous tubular filter-bed and in- 
cludes consideration of respiraiory pauses and the mechanical mixing of 
gas flow. Derived equations relate regional deposition, distribution of 
aerosol in the expired air, and efficiency of removal at different depths 
in the respiratory tract. 


Introduction. Predictive calculations of aerosol deposition in 
the respiratory tract were first made by W. Findeisen (1935). He 
treated the lungs as a branching system of discrete passageways 
and applied fundamental principles of particle motion in air to get 
expressions for the removal of aerosol in individual passageways. 
These calculations were later modified and extended by H. D. 
Landahl (1950) to apply to conditions of experiments in which 
aerosol was measured in four equal fractions of expired air(Landahl, 
Tracewell, and Lassen, 1951, 1952). The validity of the calcula- 
tions was checked by the degree of agreement between the pre- 
dicted and measured amounts of aerosol in the fractions of expired 
air. However, where significant differences were observed be- 
tween predicted and measured values of expired aerosol, it was 
not practical to use the measured values to modify the values of 
the physical parameters assumed for the prediction and to adjust 
the values of regional aerosol depositions. 


257 


258 BERNARD ALTSHULER 


Measurements of aerosol in fractionated expired air were used 
by J. H. Brown, K. M. Cook, F. G. Ney, and T. Hatch (1950) to 
estimate aerosol deposition in the upper and lower respiratory 
tracts. Using a model of aerosol removal with just two sections, 
one section representing removal of aerosol by the upper tract, and 
the other removal by the lower, the differentiation of deposition in 
the upper tract (respiratory dead space) from deposition in the 
lower tract (alveolar region) was made from simultaneous measure- 
ments of carbon dioxide in the fractions of expired air. 

In this paper a model of aerosol removal by the respiratory tract 
is introduced which uses the analogy of a continuous filter-bed. 
The treatment relates regional deposition with local removal char- 
acteristics and the distribution of aerosol in the expired air. The 
distribution of aerosol in the expired air can be determined ex- 
perimentally with instruments that measure flow and light scattered 
from aerosols (Altshuler, Yarmus, Palmes, and Nelson, 1957). 
With suitable conditions, the treatment can be applied to check 
theory with experiment by comparing their results for local re- 
moval characteristics and to obtain regional deposition both ex- 
perimentally from expired aerosol and theoretically from local re- 
moval characteristics predicted from basic physical principles. An 
accounting for the mechanical mixing of gas volume during flow is 
also included. 

The respiratory tract is assumed to act as a single tubular filter- 
bed through which air flows in and out, in cycles. The respiratory 
cycle may contain periods of no flow between those of inspiratory 
and expiratory flow. During an entire flow period, each element of 
the filter-bed is assumed to remove a constant fraction of the par- 
ticles in the aerosol that passes through it. This assumption is 
made plausible by considering the respiratory pattern to be ap- 
proximated by one in which flow is constant during periods of in- 
spiration and expiration. During a period of constant flow, all 
particles that pass through a particular region are expected to 
have equal probability of being deposited there because the inertial 
effect, which depends on velocity, does not change and because 
the time of passage remains the same for evaluating the effects of 
gravity and Brownian motion. 

The tubular filter-bed is identified with the branching passage- 
ways by taking regions of the same branching order to be equiva- 
lent, summing equivalent volumes, and identifying these sums with 
volume sections of the filter-bed in the order of the branching. 
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Two modes of dealing with the mixing of volume elements of the 
respiratory gas during flow have been employed in this paper. Part 
I assumes there is no mixing. Part II treats a particular type of 
mixing. The mixing is taken to be a mechanical axial process 
which exchanges aerosol between neighboring sections along the 
tube and distributes aerosol uniformly in the cross section. 


I. Deposition Without Axial Mixing 


Filter-Bed Characteristics. The quantities which characterize 
the tubular filter-bed representing the respiratory tract are: 

2, lung depth; it is the volume between the entrance of the re- 
spiratory tract and the lung site at this depth. 

f,(z) or f, (2), inspiratory or expiratory removal efficiency; during 
inspiration (or expiration) /,(x) dw (or f,(x)dz) is the probability 
that a particle reaching a lung element of volume dz at depth z is 
removed at that place. 

h,(x) or h,(x), interior or exterior removal efficiency; during the 
respiratory pause following inspiration (or expiration), A,(«) (or 
h,(x)) is the probability that a particle in a lung element at depth 
xz is removed there. 

F(a) or F,(x), inspiratory or expiratory penetration; during in- 
spiration (or expiration) F(z) (or F',(2)) is the probability that a 
particle can be carried from entrance to lung depth 2 (or from lung 
depth z to entrance) without being removed. 

P(x), combined penetration; it is the probability that a particle 
can be carried from entrance down to depth x, remain suspended 
during the respiratory pause, and carried back out again without 
being removed. 

These removal efficiencies are functions of lung depth and can 
be estimated from basic physical principles as the combined proba- 
bility of removal due to Brownian motion, sedimentation, and im- 
paction. Local removal expressions for these three processes are 
given by Findeisen (1935) and Landahl (1950) or may be derived 
from more refined considerations (e.g., Gormley and Kennedy, 
1949). The relationships between the removal efficiencies and the 
penetration functions are derived by considering two neighboring 
sites at depths z and z+dz. The definitions imply F,(« + dz) = 
F(a) (1 - f,(2) dz] which leads to the relations 


1 dF (2) 
F(z) dz 


L 


f,(2) = - Preece, (1) 
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Similarly, one obtains 

xz dF, (2) 
Fi(z) dea 
According to the definitions, a particle which 1) penetrated to a 
depth « during inspiration, 2) remained suspended during the in- 


spiratory pause, and 3) reached the mouth during expiration had 
the combined probability 


P(#) =[1-A,(@)1F;,(2) F, (@) (3) 


E (a) =a ae ; F, (a) = sedan fe (x)dx : (2) 


of not being removed. 


Concentration and Deposition. Let v, denote tidal volume and 
let the concentration of the inspired aerosol be the unit of aerosol 
concentration. Assume volume elements of tidal air are not mixed 
axially so that they move in and out preserving their sequential 
order. This means for the remainder of Part I that the penetration 
functions represent aerosol concentrations, Thus, the concentra- 
tion in the tidal air at depth ¢ is F;(z) during inspiration and is 
[1 -A,(x)] F,(«) = P(2)/F,(#) at the start of expiration. P(#)is 
the aerosol concentration leaving the mouth after the expiration of 
a volume equal to 2 and is measurable. During a respiratory cycle 
the amount of aerosol which penetrates deeper than z is(v, — 2) F;(z). 
Since the amount of aerosol which is both expired and had pene- 


trated deeper than z is / ; P(#)dz, the amount of aerosol which 


x 


leaves the distal part of the respiratory tract of greater depth than 


v 
zis F (#)* | *P (2) dz. The amount of aerosol deposited deeper 


x 
than z is the difference in these two expressions for the aerosol 
that enters and leaves the lower tract; expressed as a fraction of 
the total amount inspired and denoted by L (2), it is 


L (w) = (1 - #/v,) (F(x) -— P, (2)/F, (2)], (4) 
where 


P=(o-ay! [ 'P(@)de (5) 


x 
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is the average concentration of expired aerosol that had penetrated 
deeper than x. Alternatively, using the expression for F’, (x) ob- 
tained from equation (3) in equation (4), the lower deposit is 


L (a) = (1-2/v,)F,(#) {1 - [1 -A(@)1P, (@)/P(@)h. (6) 


The amount of deposition in any region of the respiratory tract 
bounded by depths z, and x,, with x, <@,, is given by the differ- 
ence, L (z,) —L (z,). 

The special case where 


F;(@) = F, (2) = F (a) (7a) 


1 


seems appropriate for the smaller particle sizes where impaction 
plays a minor role and the inspiratory and expiratory flow patterns 
are alike. (The impaction process of removal is more efficient 
during inspiration because diameters of the passageways decrease 
with increasing depth; this asymmetry is counteracted in some de- 
gree by the fact that time for expiration is usually greater than 
time for inspiration.) Here, P(x) =[1- h,(x)] F (x)? and substitut- 
ing for F’;(#) in equation (6), the lower deposition is 


L(#) = (1- 2/v,) {P(#)/[1- h(a)Ft —[1-A,(#)] P, (@)/P(@)}. (7) 
Specializing further with the additional assumption of no interior 
respiratory pause, 1.e., 
F(z) = F, (2) = F (2) and A,(x)=0, (8a) 
deposition below the depth 2 becomes 
L (e) = (1 - 2/v,) P (2) [1 - P, (@)/P (2)). (8b) 


Equation (8b) is readily comparable with the analogous expres- 
sion obtained by Brown, e¢ al. (1950) for their two-section model 
of aerosol removal. When translated into our notation, their quan- 
tity ‘‘AP,’? which they used to denote deposition occurring deeper 
than the anatomical dead space, is 


“AP” = (1-2/0) Py (all ~ Py, (2)/Py(@)1, (9) 


where 


Py(a) =a! [PCa de (10) 
0 
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is the average expired concentration of the aerosol that did not 
penetrate deeper than z. Equation (9) differs from equation (8b) in 
that P,,(#) is substituted for P(«), the concentration of expired 
aerosol which just penetrated to depth 2. Since P,,(z) is greater 
than P(a) (probably P(x) = P, (x)? is a fair approximation of their 
relationship for the more stable sizes), the model used by Brown, 
et al. gives smaller amounts of deposition in the upper tract than 
is obtained using the continuous and more accurate filter-bed model 
of this paper. 


Experimental Verification of Physical Theory. Expired con- 
centration can be measured as a continuous function of expired 
volume and its derivative can also be obtained experimentally. In 
the simplest case where the inspiratory and expiratory penetration 
functions are equal and there is no interior respiratory pause, the 
deposition expression (8b) is particularly simple, depending only 
on the measured function P(z). The removal efficiencies are also 
simple expressions in P(z), namely 
1° 3a (e)} 1 1 dP(2) 


F(z) dz 2 P(e) de ’ 
from equations (1), (2), (3), and (8a). 

The expression on the extreme right side of equation (11) is a 
significant quantity in the general case and so we introduce 
1 1 dP(z2) 

2 -P(s) ‘de 


f,(@) = f,(@) = - (11) 


(12) 


as a normalized slope of the measurable function P(z). From equa- 
tion (3) we have 


s(x) = - [f+ 1 +(1-2,)"' il (13) 
a a . dz }° 

When breathing is without respiratory pause, the normalized slope 
is the average value of the inspiratory and expiratory removal ef- 
ficiencies associated with the same volume. The normalized slope 
has the significance of being the experimental quantity which is 
most closely related to the local dynamics of particle removal. As 
the closest link between experiment and physical theory, it is use- 


ful for determining the physical parameters governing local parti- 
cle removal. 
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Ul. Deposition With Axial Mixing 


Mixing. Mixing is a significant factor in the pulmonary handling 
of aerosols. Data obtained in aerosol clearance experiments 
(Altshuler, Palmes, Yarmus, Nelson, 1959), in which clean air was 
inhaled after the breathing of aerosol, implies that, if the aerosol 
is of the most stable size, as much as half of the aerosol deposited 
in a particular breath of aerosol may have been inhaled in previous 
breaths. Little is known about mixing during flow in the respira- 
tory tract. To treat this case, we make certain assumptions that 
stem from our experience with flow in tubes. 

The mixing action of the flow through the tubular filter-bed is 
assumed to be uniform throughout the tube and given by a single 
function M(v,z) which is defined as a step-function response in 
the following manner. Start with a jump in aerosol concentration 
from zero to one at any point in the tube, with uniform concentra- 
tions of zero and one in the tube on either side of this point of 
discontinuity. A flow of volume v is imposed in the direction of 
zero concentration. The mixing function M(v,z) is defined to be 
the concentration that would exist at the site down the tube sepa- 
rated from the original site of discontinuity by the volume a, if 
there were no removal action by the tube. It is assumed to be in- 
dependent of the flow pattern. The mixing function of choice at 
the present time is 


_ p (w= kx)/(1—k)% >k 
M(o,2) = {5 ‘i oe (14) 


where the parameter k, A <k <1, is an index of the degree of mix- 
ing. (The value & = 1 is associated with no mixing and & = > with 
laminar flow; k=0.7 is a reasonable estimate.) The partial de- 
tivatives M,(v,7) and —M,(v,z) are weighting functions giving 
the contributions per unit volume to the composition at separation 
depth 2, of an element of flow volume and of an element of tube 
volume, respectively, after the flow volume v. 


Distribution of Aerosol Concentration with a Single Breath of 
Aerosol. Let v, denote the inspiratory tidal volume. Assume the 
removal characteristics of the filter-bed are those given in Part I 
and the mixing characteristics of the flow are given by a mixing 
function M (v, 2) as defined above. 
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The distribution of the inspired air along the tube at end of in- 
spiration is effectively M(v,,2), considering just the mechanical 
mixing of the flow; combining this mixing action with the inspira- 
tory penetration function F(x) gives the distribution of aerosol 
along the tube at the end of inspiration to be F(z) M(v,,z). During 
the interior respiratory pause this concentration is reduced by the 
holding factor 1-/,(x) to give the distribution [1—A,(x)] F; (7) M(,, 2) 
along the tube at the beginning of expiration. After expiring vol- 
ume v,, the air at the mouth is the sum of elements of air which 
come from various depths in the tube. The tube element dz at 
depth xz at the beginning of expiration effectively makes the con- 
tribution —- M, (v,,2)dz to the composition of the air at the mouth 
after expiratory volume v,; its initial concentration is reduced by 
the expiratory penetration factor F’,(x); and thus it makes the con- 
tribution F’, (z)[1 - A,(z)] F; (x) M(v,, 2) [- M, (v,, 2) dz] to the mouth 
concentration. The mouth concentration, denoted by C(v,), is the 
total contribution obtained by integration: 


ve /k 
0(»,) = ={ P(x)M(v,,2)M,(v,,2)de, v, $v,3 (15a) 
10) 


v,/k 
C(v,) = zi P(a)M(v,,2)M,(v,,2)de,  v,2,, (15b) 
0 


where we have introduced the combined penetration function P (z) 
from equation (3). 

In this development, the central problems involved in evaluating 
the pulmonary removal of aerosols are expressed in terms of the 
penetration functions and the mixing functions which determine the 
measurable concentration function C(v,). Because integration is 
a smoothing operation, the concentration measurement lacks sensi- 
tivity for evaluating the penetration and mixing characteristics. 
Nevertheless, a useful approach is made by introducing a power 
series representation for P (2): 


P(a) = : aa”. (16) 


n=0 
By substitution in equation (15a), 


co 


C(v,)=)° a, 8A, (,), (17a) 


n=0 
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where the functions 


v./k 
A (v,) = =f (a/v,)"M(v,, 2) M(v,, 2) dx (17b) 
) 


express the effect of mixing in modifying the removal action given 
by the combined penetration function. Using equation (14) for 
M (v, 2), equation (17b) becomes 


A(v,)= ko {Be S33 (e ere =! ee plete et) 
uA v; nF we 
where 
k’ =k/(1 -k) (19) 


is an alternate mixing parameter and the functions B(k’) are de- 
fined recursively by 


By(k’) =13 


B ,(k’) | ae ar es (20) 


’ 


B i(k’) =(n-1) 'HM-B_ i(k), n>1. 


For 4 =1(k’ = ~) representing no mixing, equation (15a) implies 

C(v,) =P(v,) as in Part I. With an idealized aerosol for which 

there is no removal action, i.e., P(x) = 1, a, = 0 for n2 1, and 
C(v,) = A,(0,) =1- (0, + -) 10,6 "Pe, oS v,. (21) 


When v,/v, is small enough, the second term in the brace of 
equation (18) becomes negligible so that A,(v,) is essentially in- 
dependent of v,, i.e., 


A,(u,)=k "Bi (k’). (22) 
Substituting this value of A, in equation (17a), 


C(v,) ve ak "BAK ) v2. (23) 


Approximating the expired concentration by the polynomial 
N 
C(v,) = D b ve, (24) 


n=0 
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the polynomial approximation 


N 
P (a) = »y [d_&"/B (ke lz" (25) 


n=Q 


of the combined penetration is obtained for the smaller values ofz. 
This approximation of the combined penetration may then be re 
lated to removal efficiencies in the upper respiratory tract by the 
expressions developed in Part I. 


Regional Deposition. Consider the steady-state breathing of 
aerosol of unit concentration. Let R(«#) denote the concentration 
of aerosol at depth 2 remaining in the respiratory tract at the start 
of inspiration; v, denotes tidal volume, and C(v,) the concentra- 
tion of expired aerosol as it depends on the expiratory volume 2, 
during expiration. The problem to be solved is to distinguish be- 
tween depositions above and below a fixed lung depth in the case 
when the upper tract is completely washed out by the tidal flow. 
This assumption of complete wash-out should be appropriate when 
the depth of the upper tract is small enough. In this case, as shown 
in the last section, removal characteristics of the upper tract can 
be approximated from the expired concentration function C (v,). 

Average expired concentration is 


oot | ‘C(v,) de, . (26) 
0 


Total deposition, expressed as a fraction of the inhaled aerosol is 
S=1-C,=U(e)+ L(2), (27) 


where (x) is upper deposition, proximal to the dividing depth z, 
and J) («) is lower deposition, distal to z. 

The following statements about the removal of aerosol particles 
in the upper tract follow from the assumption that the air in the 
upper tract is completely displaced by the tidal flow. The influ- 
ence of the mixing of the flow is confined to its effects on thecon- 
centrations C(v,) and R (2). 

The amount of aerosol entering the lower tract during inspiration 
is 

x 
2) F,(2) + [ ROFL" F(a) dy 
0 


(the probability that a particle is not removed in going from y to z 
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is F;()/F (y) by equation (1)). That part of the total expired aero- 


sol v,C, which did not enter the lower tract is xP, (x) -/ P(y)dy. 
0 


Thus the remainder v,C, — xP, (x) had been in the lower tract and, 
because of attenuation, had come from the amount [yC, - #P,(x)]/F, (2) 


x 
which left the lower tract during expiration. The aerosol ii R (y) dy 
0 
that was left suspended in the upper tract at the end of the respira- 


tory cycle came from the aerosol [ R(y)(1-h,(y)\* dy which 
) 
was left there at the end of expiration and which in turn came from 


x 
the aerosol [ R(y)(1-h, (yl 'F.(y)F, (2) 'dy which left the 
0 


lower tract during expiration. Thus the total aerosol which left 
the lower tract during expiration is 


[v,C, - Py (2)IF, (2) + [ "R(y) (1 - A, (QF, (y) F, (#)* dy. 


0 


The difference between that which enters the lower tract and that 
which leaves, expressed as a fraction of the tidal volume aerosol, 
is the lower deposition 


L(a) = (1 -2/v,) F,(e) - [1-5 - Py (2) (2/2, 1F, (2) * + 
(28) 


mt [Ie @) Fy) - (1 -A YF) FP (2) Ry) ay, 
0 


where we have used C, =1- o. Then by equations (27) and (3), 
upper deposition is 


(a) = F,(#) 141 - F,(2)F, («) - 1 - F, (2) 5 - [Py (2) - 


F(e) F, ()l(a/2,) + 
> (29) 
oy} [ [1 — A,(y)I7 1 - h(E Py) - 
0 


[1 - A, (a) P (23k (y) Fy) * dy 
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Introducing 


R* (2) = 07 IP, (2) - Prt fo (1-4, (yl - A (yl - 
; (30) 
h,(2)1P(y) - P(@SR(Y)F,(y) ay, 


and using equation (3) again, equation (29) can be put into the form 
(a) =(1-P)(F,/P){1-(1-F,)(1-F,F.) 19 - 
(Py - P)(1 - P)1(1 - R*) (@/2,) - (31) 
h(i - P) 1 -(1-F,)(1-F,F.) 19 - Py (#/)]). 


This last equation is suitable for examining the effects of the 
various quantities and for estimating upper deposition from the 
measurement of expired aerosol. From the measurement of expired 
aerosol, the total deposition 3 is determined directly and the pene- 
trations P(x) and P(x) by the method of the last section. The 
quantity R*(x) need not_be known accurately; it does not differ 
much from the average concentration of aerosol in the upper re- 
spiratory tract at the start of inspiration and can be estimated from 
experiments with forced expirations. The quantities F;/P and 
(=-E HEF. )~! can be estimated satisfactorily in most situa- 
tions from theoretical values for local removal. The influence of 
the respiratory pause is expressed through the parameter h,(1 — P)™ i 
which needs also to be determined from theoretical estimates of 
local removal. 

The expression for upper deposition has been formed in equa- 
tion (31) so that the necessary theoretical estimates are used to 
determine relative quantities, whereas the absolute quantities (P, 
Re 3, R*) are measurable. The roles of the various factors on 
the right side of equation (31) are directly recognizable when the 
volume of the upper tract and the respiratory pause are small. The 
first factor (1 - P) is dominant because most of the inhaled aero- 
sol passes through the upper tract in both directions. The second 
factor F;/P reflects the fact that more aerosol goes through the 
upper tract during inspiration than during expiration. Inside the 
braces, it is seen that a greater amount of aerosol left in the upper 
tract (K*) allocates more deposition to the upper tract and that a 


greater respiratory pause, h,(1- P)~', allocates less deposition to 
the upper tract. 
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Symbols 


a,—coefficients of power series expansion of P(x), equation (16) 

A,,(v,)—defined by equation (17b) 

b,—coefficients of a polynomial approximation of C(v,), equation 
(24) 

B_(k’)—defined in equation (20) 

C (v,)—expired concentration measured at the mouth after volume 
v, has been expired 

C,—average concentration in total expiration 

f,(z) or 7, («)—During inspiration (or expiration) /,(x)dz (or f, (x)dz) 
is the probability that a particle reaching a lung element of vol- 
ume dz at depth z is removed at that place. 

F(z) or F, (z)—During inspiration (or expiration) F;(x) (or F, (#)) 
is the probability that a particle can be carried from entrance to 
depth z (or from depth z to entrance) without being removed. 

F (x)—denotes F(x) = F, (x) when these are equal 

h,(w) or h,(z)—During the respiratory pause following inspiration 
(or expiration) 4,(x) (or h,(«)) is the probability that a particle 
in lung element at depth z is removed there. 

k, k’—mixing parameters introduced in equations (14), (19) 

L(x), &(x)—fraction of total deposition that is deposited distal to 
depth z in the cases where there is no mixing and where there is 
mixing, respectively 

M (v,x)—mixing function defined in the text before equation (14) 
is introduced, with partial derivatives M,(v,z) and M, (v, x) 

P(«)—probability that a particle can be carried down to depth z, 
remain suspended during the pause and carried out again without 
being removed 

P, (w)—average value of P (y) with ag an 


P y (#)—average value of P (y) with y S$ 
R i) -oaconceuieation remaining at see a at start of inspiration 
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s(z)—normalized slope of penetration function P(«), defined by 


equation (12) 
3j—total deposition expressed as a fraction of the amount of aero- 


sol inhaled 

U(x), U(«)—fraction of total deposition deposited proximal to 
depth # in the cases where there is no mixing and where there 
is mixing, respectively 

v—variable denoting volume of respiratory gas 

v,— expiratory volume 

v,—inspiratory tidal volume 

v,—tidal volume 

z—variable denoting volume of lung depth 
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An integrative mathematical approach to endocrinological homeostasis 
is presented. By means of it, three homeostatic systems of increasing 
complexity are analyzed. The two-variable system described embraces 
the thyroid hormone and the thyrotropic hormone; the three-variable sys- 
tem, calcium, phosphate, and the parathyroid hormone; and the four- 
variable system, sodium, water, the antidiuretic hormone, and aldo- 
sterone. The Laplace transforms of the variables are found. The han- 
dling of these transforms utilizes well-known techniques. 


Homeostasis in a number of endocrinological systems is essen- 
tial for mammalian life (Cannon, 1939). The organism must limit 
the variation of many endocrinologically controlled substances and 
physical characteristics to a narrow range. Dysfunction, disease, 
and death result from inadequate control of these variables (Selye, 
1959). Research has led to identification of the controlled quan- 
tities and has indicated their physiological limits, as well as the 
effects of deviation from these limits. Knowledge in this field is, 
however, still in its introductory stages. Fragmentary information 
exists with regard to the nature of the homeostatic processes— 
the physiological standards, the means of measurement, the trans- 
mission of signals to the control organs, the specific operation of 
these organs. One of the obvious reasons for this lack of knowl- 
edge is the present inadequacy of technical methods. An equally 
important factor may be the absence of an integrative viewpoint. 
The present work, based upon mathematics, is a contribution to- 


ward such an integrative approach. 
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Theoretical Results 


I. The system of the thyroid and anterior pituitary gland. Three 
different patterns of endocrinological homeostasis are presented 
in this paper. While the differential equations which result can be 
solved by direct methods, use of the Laplace transform facilitates 
their handling considerably. The distribution of each substance 
throughout the physiological space it occupies is assumed to be 
instantaneous, and minor fluctuations in its concentration from 
point to point in space are neglected. In the first system—the 
thyroid-anterior pituitary relationship—the controlled variable is 
the thyroid hormone (Brown-Grant, 1957). The total amount of thy- 
roid hormone outside the thyroid is represented by 2. Thyroid hor- 
mone bound to serum protein is apparently part of the active pool 
(Albright, Larson, and Deiss, 1955). The physiological volumes 
V, and V, in which the thyroid hormone and TSH, respectively, are 
chiefly dissolved are considered to be essentially constant during 
a given short period of time. It will be assumed that the rate of 
secretion of thyroid stimulating hormone from the anterior pituitary 
gland is directly proportional to the difference between @ per unit 
volume and a physiologically standard value of z per unit volume, 
namely x,, while the rate of destruction of TSH is proportional to 
the amount of it present outside the pituitary gland per unit vol- 
ume. These assumptions appear reasonable but cannot be exactly 
evaluated at present because of the inadequacy of experimental 


data. Thus, 
aP()_ fe Rese 
dt i 1 V Ti vy = a ce re = (1) 


x 


where ¢ = time, P = total amount of TSH in the body outside the 
pituitary, &, and &, are proportionality constants, and k, is a 
constant adjusting for stray secretion when «/V.=2,. Since TSH 
responds relatively slowly to changes in a2 (Bottari, 1957), k, must 
be small; on the other hand, experimental work indicates that ks 
must be considerably larger (Levey and Solomon, 1957). The re- 
cently postulated effect of the hypothalamus in this system is ex- 
pressible by changes in the value of k, (Brown-Grant, 1957). The 
Laplace transform of equation (1) (Gardner and Barnes, 1942; 
Wiener, 1948) is 

a 


k k k 
oP) = tO) eee v. x(s) - r P(s) 
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and 
a x(s) 
SPO). ke, +k, V 


x : (2) 
k 
8 ( =| 
Vp 


Now, it is consistent with available data to assume that the 
rate of secretion of thyroid hormone is proportional to the rate at 
which TSH passes through the thyroid gland (Soderberg, 1958), 
while the rate of destruction of the thyroid hormone is proportional 
to the amount of it per unit volume outside the thyroid. If the rate 
of blood flow through the thyroid, F, is considered to vary over a 
slight relative range and to be essentially constant during a given 
short time interval, the rate of flow of TSH through the thyroid is 


P(3) = 


(P-F)/V,. The value of F is affected by many factors, one of 
which is sympathetic tone (Soderberg, 1958). In the absence of 
absorption of exogenous thyroid hormone from the gastrointestinal 
tract, there results 

d z(t) em 


s P +k, 
dt Vy 


h@ 
6 
LE 3 
: (3) 
x 
where &, and &, are constants of proportionality and &, corrects 


for the slight output, if any, when P(t) = 0. Then, 


sz(s) — 2(0) = oo P(s) + a - 52 X (9) 


x 


When the value of P(s) in equation (2) is substituted into this 
equation, the result is 


2(s) |8+ — + ———__ || = 


(4) 
k k,F |\sP(0)+k,#,+k 
X(0) + + —* ea steteotat Ba 
8 Vz ( ee 
s {s+ — 
Vp 
The solution for z(s) has the form, 
As*+Bs+C 
a(s) = (5) 


s(s2+Ds+E)’ 
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where the constants A, B, C, and D are, respectively, equal to 


A = «(0); 
OV‘ k,F P(O 
B= oC beet ae eeu! 
V> Vp 
ow Babs, Bibgeih | bahyh 
Vp Vp Vp 
k k 
D=—% 4 8, 
Vp Wey 
op bake + Pik, 
ie Vp 


Solution of this equation requires, first, determination of the 
roots of the characteristic equation, and then application of simple, 
straightforward principles for handling the Laplace transform. The 
requirement that the characteristic equation have roots at the ori- 
gin or in the left half of the complex s-plane for a finite non-oscil- 
latory solution leads to the usual restrictions upon the permissible 
values of D and E. In addition, EF cannot be equal to zero, since, 
then, s? would appear in the denominator and the increasing term 
constant -¢ in the final solution. At t=, z(¢)-= C/E, since the 
coefficient of the term 1/s in the expansion by partial fractions of 
z(s) is C/E. It is interesting to note that division of this final 
value by V_, yields a result different from ,. When 2(s) in equa- 
tion (5) is substituted in equation (2), the solution for P(s) is ob- 
tained. Its characteristic equation is very similar to that of 2(s), 
but is of one higher degree. 


Il. The system of calcium, inorganic phosphate, and the para- 
thyroid glands. The second system which will be considered is 
the one involving calcium, phosphate, and the parathyroid glands. 
Here, we shall let 2 = total amount of diffusible calcium dissolved 
in volume V, of extracellular fluid outside of bone, and y = total 
amount of inorganic phosphate dissolved in a similar volume V_. 
Protein-bound calcium has distinctly different properties from 
physiologically active diffusible calcium (Salvesen and Linder, 
1924). Since the deviation of calcium per unit volume from a phys- 
iologically standard value determines the secretion of parathyroid 
hormone (Howard, 1956) and since the destruction of this hormone 
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is, reasonably, proportional to its level per unit volume, there re- 
sults an expression similar to equation (2), namely, 


Pista) 
Bites SP 4(0) + kye, +k, - ye (7) 


k k 
Vp Vp 


Here P, is the total amount of parathyroid hormone dissolved in 
volume ae and the constants retain the same significance as for 
the thyroid system but have the values of the parathyroid system. 

The parathyroid hormone apparently has two actions—mobiliza- 
tion of calcium and phosphate from bone (Barnicot, 1948) and con- 
trol of phosphate excretion by action upon the kidney (Ellsworth 
and Howard, 1934). If it is assumed that the net rate of mobiliza- 
tion of calcium from bone is proportional to the value of P, per 
unit volume and that the rate of loss of calcium by excretion in 
urine and feces is proportional to the level of calcium per unit 
volume, then, when there is no absorption of exogenous calcium 
from the gastrointestinal tract, there results 


dz _ kPa k ke @ 


5 
‘ae V. 


(8) 


Again, since net mobilization of phosphate from bone is assumed 
to be proportional to P,/V,, while its renal excretion is propor- 
tional to the same term and, in addition, to the level of phosphate 
per unit volume (Howard, Hopkins, and Connor, 1953), the ex- 
pression for dy/dt becomes 

a Rid (9) 


did 12 y 


The term kioy/V, is necessary to prevent the occurrence of Se 

the denominator of the solution for y(s). Since y cannot be age 

than zero, dy/dt must be equal to or greater than zero when y = 0, 

k,—-k 
7 


and, hence, : P, +, must be equal to or greater than 


zero when y= 0, or k, must be greater than kg. Equivalent con- 
ditions exist for equations (1), (3), (7), (8), (11), and (14). 

The solution for x(s) results from substitution of equation (7) 
into the transform of equation (8), and has the same form as equa- 
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tions (5) and (6), namely, 
As?+Bs+C 


A)" SPs Des B) 


with the expressions for the coefficients the same as previously 
given. With substitution of the equations for P(s) and 2(s), y(s) 
becomes 


if k,-k, 8P,(9)+#,2,+ k 
y(s) = ——— |y(0)+ + *._4 __~ + - 
K 19 Vp ( Se 
8+ — S| Susie 
VS Vp 
(Kin — Kg) k, ge: As*+Bs+0 kg 
V k s(s?+Ds+E) 8s 
P. 


This equation for y(s) has the form 


Gst+Hs? + Ms?+Ns+Q 
y (s) = ——__—___________________, (10) 


k k 
s (a4 = + 1S aS 
Ks i“ 


where G, H, M, N, and Q are constants the values of which are 
easily determinable, and, for brevity, will be omitted here. The 


Q 


(as 


It is interesting to note that the characteristic equation of y(s) 
has considerable similarity to that of z(s). This means that at 
least some of the terms in y(¢) have the same general form as the 
terms in 2(¢). In addition, the restrictions upon D and E apply 


equally to a(s) and y(s). The solution for P, (s) is equally straight- 
forward. 


final value for y(¢) at t= ~ is 


Ill. The sodium-water system. The final system under considera- 
tion is the sodium-water one. We shall let 2 = total amount of so- 
dium ion in the extracellular fluid and V = volume of the extracel- 
lular fluid, Situated in the hypothalamic region, apparently, is a 
tissue which is sensitive to the intravascular volume and controls 
the adrenal secretion of aldosterone in some unknown way, perhaps 
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neurogenically (Bartter, et al., 1958), If, over a short period of 
time, the intravascular volume is considered to be a constant frac- 
tion of V, it appears reasonable to assume that the output of aldo- 
sterone is proportional to V — V,, where V, = value of V equiva- 
lent to the physiologically standard value of the intravascular 
volume. If the destruction of aldosterone is proportional to its 
level per unit volume, the result is 


ws kV = V5) +k Ms 7 ( 
est at lees elie ai ico 11) 
dt pe 

where Z = total amount of aldosterone outside the adrenal gland 
dissolved in volume V,. Then, 


8Z(0) +k, Vi +k, eV a8) 


k k i 
V, V, 


Now, the reabsorption of sodium in the kidneys is assumed to be 
proportional to the amount of aldosterone per unit volume (Muller, 
Mach, and Naegeli, 1955), while the excretion of sodium in urine, 
sweat, and feces is dependent upon its concentration. Since the 
amount of variation in V is ordinarily small compared with the 
actual volume over a short period of time, the value of V in which 
the sodium is dissolved may, for mathematical simplicity, be ap- 
proximated by a constant value V.. Then, 


dz Fe is 

hie AG Saas. 28 + kg. (13) 

dt eS Ve 
Since, in the absence of absorption from the gastrointestinal tract, 
the total amount of sodium in the extracellular fluid, with no source 
of replenishment (Strauss, 1957), can only decrease, k,Z/Vz must 


be less than (k, 2/V,)+,. In addition, since the value of @ can- 
not be less than zero, the boundary condition which must be satis- 


Z(s) = (12) 


V 
fied is that dx/dt must equal zero and, hence, 7 must equal = (k.) 
5 


when 2 = 0. 

The secretion of antidiuretic hormone from the posterior pituitary 
gland is dependent upon the concentration of sodium, which is the 
chief physiological determinant of extracellular osmotic pressure 
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(Verney, 1948). As a result, it appears reasonable to assume that 


es (Fe -28) yh the (14) 
W 

where W = total amount of antidiuretic hormone in volume Vy of 
the extracellular fluid and 2, is a physiologically standard value 
of sodium per unit volume. When 2/V, exceeds z,, the secretion 
of antidiuretic hormone increases, since the resultant retention of 
water by the kidney causes a homeostatic decrease in the concen- 
tration of sodium, and vice versa. 

The rate of movement of water into the extracellular compartment 
from the intracellular compartment is assumed to be proportional to 
(«/V,) - #,, where x, is the concentration of sodium which would 
yield the value of the osmotic pressure of the intracellular fluid 
during the given short period of time. The values of z, and z, 
have no fixed relationship to each other. The equation for dV /dt 


is 
ui G [ k ( a (15) 
—=k,,. (—- 2} - + + ~ : 
dt Ea 2 it 12 13 Ee 


In this equation, &,, represents evaporation of water through the 
skin and lungs and &,, the obligatory excretion of water through 
the kidneys, and both of these quantities are assumed to be con- 
stant over a given short period of time. The obligatory excretion 
of water is dependent less upon sodium excretion than upon such 
diverse factors as glomerular filtration rate and serum albumin 
(Best and Taylor, 1950). The term &,,W/V, represents the reab- 
sorption of water by the kidney due toantidiuretic hormone and must 
be less than k,,- 

Combination of the Laplace transforms of equations (11) and 
(13) yields 


k 
a(s) p+ pt] + veo a = 2(0) + 
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Similar combination of the Laplace transforms of equations (14) 
and (15) gives 


— x(s) ee kek, + V(s) [s] = 
a (or) 
W 
reer cut i q (17) 


Rig sW(0)-k,@, +k, 


i aciads (s+) 
Vy 
Solution of these two simultaneous linear equations yields 2 (8) 
and V(s). The solutions for W(s) and Z(s) can then be easily 


found. For brevity, these solutions will be omitted. Procedures 
for final solution in terms of the time are straightforward. 


Discussion 


The body has more than one mechanism for homeostatic control 
in endocrinological systems. The three closed-cycle systems dis- 
cussed in this paper illustrate various degrees of complexity. Thus, 
in the thyroid-pituitary axis, there are only two variables—the thy- 
roid hormone and TSH, Three variables are present in the calcium- 
phosphate-parathyroid system. Finally, in the sodium-water sys- 
tem, four variables are present, the two in addition to sodium and 
water being antidiuretic hormone and aldosterone. The degree of 
the Laplace transforms increases in agreement with the increased 
complexity of the systems. 

The mechanisms discussed in this paper do not by any means 
exhaust the methods available for physiological control. They rep- 
resent the behavior of the body independent of external factors. 
The organism as a whole, however, behaves with respect to its ex- 
ternal environment in such a way as to discourage excessive fluc- 
tuation of controlled quantities. Thus, when sodium is high and 
body water low, thirst supervenes, and the body takes in water 
from external sources. Similarly, mammalian organisms drink se- 
lectively from sources containing ions in which they are deficient 
(Richter, 1943). But these behavioral patterns are not readily ex- 
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pressible mathematically, since they are not strictly predictable in 
occurrence, are ordinarily affected by a wide variety of external, as 
well as internal, factors, and, mathematically speaking, represent 
discontinuous rather than continuous variables. Thus, they would 
introduce great mathematical complications with little real gain in 
understanding of basic relationships. 

In most cases, the assumptions made in this work appear to be fea- 
sible in the light of present knowledge. The use of proportionality 
rather than more complex relationships between controlled and con- 
trolling variables prevents the mathematical principles from being 
submerged by impossibly complicated mathematical expressions. 
A reasonable guess is that the actual relationships are more com- 
plicated but that proportionality will turn out to be a close ap- 
proximation, at least over limited ranges. The effects of lags in 
response and inactive zones have been omitted for similar reasons. 

The Laplacian equations which have been found are basically 
simple in principle, but because of the relatively large number of 
parameters and the problems attendant upon determination of the 
roots of the characteristic equations, the actual manipulations may 
be burdensome. Calculating machines would be of assistance. It 
is interesting to note that the parameters cannot exist in complete 
independence of each other. Certain constraining relationships 
must exist between them for compliance with physiological require- 
ments and in order that the roots of the characteristic equation will 
always lie at the origin or in the left half of the complex plane. In 
this way, functions which are physiologically implausible, oscillate 
indefinitely, or increase in value as ¢ increases are avoided. The 
inverse transforms in terms of ¢ consist of the sum of constant 
terms and decaying exponential terms, some of which are oscilla- 
tory. The behavior of the time variables is thus quite complex. 
The parameters must be constant during the period under analysis. 
Hach time they change in value, a new group of numerical calcula- 
tions with different initial conditions and different parameter values 
is necessary. Theoretically, a very large set of calculations might 
adequately express the behavior of a system in which the parameters 
change in discontinuous steps of very brief duration. However, if 
the variation of the parameters is continuous, a completely new and 
much more difficult problem exists. 

With increasing refinement of knowledge about endocrinological 
homeostasis, the type of mathematical formulation presented in 
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this paper will become essential. Such a mathematical approach 
will be of great usefulness not only in integration and simplifica- 
tion of masses of data, but also as a means of prediction of new 
directions of investigation. 
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The case of alternating stimulus patterns of moderately high intensity 
is considered in terms of the model previously discussed (Landahl, 1957, 
Bull, Math, Biophysics, 19, 157-62). If both of the alternating patterns 
have the same light-dark ratio, then the relation between the period of 
the longer and the period of the shorter pattern at the critical flicker 
frequency is independent of the light-dark ratio and is given by a dimen- 
sionless expression which is roughly in agreement with data in which 
the light-dark ratio is one (C. R. Brown and D. M. Forsyth, 1959, Science, 
129, 390-91). 


It is the purpose of this note to give an expression for the criti- 
cal flicker frequency for alternating patterns in which each pattern 
consists of a constant intensity which is maintained for a certain 
fraction of its period, the intensity being the same for both pat- 
terns. We shall use the two-factor theory model discussed previ- 
ously (Landahl, 1957). Let A and B be the periods of the first 
and second patterns, respectively, and let A 2B. Let the light be 
on for an interval rA, off for an interval (1 —7)A, then on for an 
interval 7’B and off for an interval (1 -7’)B. Consider the steady 
state. Then, if the intensity is moderately high, the frequencies 
being considered will be high, so that the slower, inhibitory fac- 
tor, j, can be assumed to be approximately constant over each 
period. The excitatory factor « is determined from 


de 
wae? 2s aE aL 
oF a(p-e), (1) 
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where a is a rate constant and ¢ depends upon the intensity (loc. 
cit., equation 3). Let «, be the value of « at the time that the light 
comes on at the beginning of the A cycle. The value of « at a time 
rA later can be easily obtained from (1) since ¢ is a constant and 


hence 


é, =" ear is o(1 -* lon, : (2) 


«, being the value of « at the end of the light period of the A in- 
terval as well as that of the beginning of the dark period of the A 
interval. During the dark phase, of duration (1 -7)A, the light is 
off so @ = 0 in (1). Hence at the end of the A interval, we have 
€=€, given by 

6 = Pe ee (3) 


At the end of the light period of the B interval, which has a dura- 
tion r’B, « will have a value e, given by an expression similar to 


(2), 


é aes +h (IS aH Ph Oye (4) 


At the end of the cycle, (1-7’)B seconds later, «, being ina 
steady state, must return to its initial value «,. Hence, since the 
light is off during this last dark period, we have 
Ey = €3 Pa Selene (5) 
Substituting «, from (2) into (3) and the resulting expression for 
€, into (4) etc., we obtain an expression from which we can solve 
for ¢,: 
3578 “ g sUinnAse® z g stash to Pet Ss 
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Now the value of j will be given by 
1A +7°B 7 


j being proportional to the fraction of the time that the light is on. 


The largest value of « — j, the net excitation, will be at the end 
of one of the ‘‘on’’ periods. If 


j= 


A?(1-r)r>B7r(i-r’, 


then « —j7 will be greatest at the end of the first light period hav- 
ing the value, obtained from (2), (6), and (7), 
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Fe te ee er 8 AB nt) 8? a 
2(A + B) 
If A*(1-1r)r<Br’(1-9’), then (c- Dm iS given by (8) with A 
replaced by B and 7’ by r and vice versa. 

By setting (« — j), equal to a threshold 4, we then have an ex- 
pression relating the intensity, through ¢, to the periods 4 and B 
for various values of the ratios r and r’.. For the case in which 
7 =r’, expression (8) represents the maximum net excitation for 
A > B, and it can be written as 

A? +(2B—A,)A-B(A, +B) =0, fess SAE ee (9) 
ar(1-r)¢ 
Note that A,, which depends on the ratio r and the intensity through 
¢, is the period at critical flicker when B = 0, i.e., A, is the re- 
ciprocal of the c.f.f. for the given r and intensity. Since expres- 
sion (9) is homogeneous, A and B can be replaced by A/A, and 
B/A, with the result that A/A, is determined as a function of 
B/A, without any arbitrary parameters. This is true according to 
the model as long asr =r’. From (9) we then find 
SFE epee eae ONE TIE A2B (10) 
a ee Oe eet 
If A < B, (10) can be used with A and B interchanged. Expression 
(10) is represented graphically by the solid curve in Figure 1. 
Data by Brown and Forsyth (1959) are included in the figure. These 
data are for the case in which r =r’ = 1/2. It can be seen that the 
principal discrepancy is that the experiment suggests that a slight 
change from equal periods (A= B) produces an increased c.f.f., 
whereas the calculated curve does not show this effect. 

The discrepancy in this region where A is nearly equal to B can 
be accounted for by considering that the threshold / is normally 
distributed with a standard deviation o. Suppose that fusion fails 
when, on the average, there is one response every unit of time, 
such as the average interval between eye movements. This proba- 
bility is proportional to the number of ‘‘on’’ occurrences times the 
probability that each responds. The latter probability for the ‘‘on 
periods of the A component is given 3 

ioe mal 


p (a= * less) sie p(é)dé. (11) 
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FIGURE 1. The relation between the periods A and B at the critical 
flicker frequency. The points use data by C. R. Brown and D. M. Forsyth 
(1959). The curves represent calculated values. 


But since (e, — 4)/o is a fairly large negative value, we may write 
as a rough approximation in the range from 2 to 50 per cent (e, < A) 


2 o 


The weighted probability of response for both A and B is then 
proportional to the frequency of A and B, 1/(A +8), and to the 
sum of the probabilities of response to A and B. That is, 


1 ‘ =s ( =ltet)| 1 L 
Poles eee Se et Ye 0, nae 
An B [ ( = +P - ye, 7 -—A), (18) 


where «? is the value of €, =€, when A =B =A). Introducing (12) 
into (13) we find 


(12) 


oO 


eV MEM EDO eV Bes En i/o_ A+B 


Ay 


But €, —j is given from (8), is given by the same expression by 
interchanging A and B, and e; is obtained from (8) by setting A = 


aes 0 Hence we find the following approximation for r =r’ = 
L/2i 


(14) 
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_ kL Ag(A+B)-24 Bl 


i k(A-B =k(A- A+B 
P (AFB) [ek CA~B) , pT RAT BY _ 


, (15) 
0 

where k = /2a¢/80. The dotted curve in Figure 1 shows the ef- 

fect of threshold variability when p =.05 in (15). It can be seen 

that the main effect is to make the derivative of the curve con- 

tinuous at A=B=A),. If the approximation (12) is not used, the 

effect is somewhat more localized. 

There remains then the fact that the points are below the curve 
by an appreciable amount. The assumption that at << 1 in obtain- 
ing (10) does not appreciably restrict the result obtained. Since 
the simple model gives the curve without parameters it would be 
of particular interest to know to what extent the intensity influ- 
ences the shape of the curve; that is, A/A, versus B/A, for vari- 
ous intensities. For the simple model, this curve is independent 
of intensity. If the deviation between the values from the simple 
model and the observed values are largely related to the frequency 
1/(A + B) but not to intensity, then one might anticipate that such 
a curve reflects the pseudo-resonance of the center registering 
flicker which has been assumed to have only a threshold. If such 
is the case, then by introducing the neural element with explicit 
time constants a’ and 5’, it might be possible to account for the 
observed results in terms of a model made up of two neural ele- 
ments in series. 
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A notion of quasi-ergodicity is defined in free monoids, generalizing a 
notion introduced in a previous paper (Rosen, 1959, Bull. Math. Bio- 
physics, 21, 71-95). It is shown that under certain conditions the alge- 
braic properties of quasi-ergodicity are very similar to those derived for 
the more specialized concept in Joc. cit. If it is assumed that the DNA- 
protein coding processes in nature are of a quasi-ergodic nature, then a 
condition is specified under which only a finite number of different DNA- 
protein codes are possible, and an upper bound for the total number of 
different quasi-ergodic codes is obtained. 


In a previous work (Rosen, 1959) we outlined an algebraic ap- 
proach to the DNA-protein coding problem, based on the theory of 
free monoids. In particular, we introduced an algebraic analog of 
the statistical concept of ergodicity, and discussed some possible 
applications of this concept to the coding problem. However, the 
definition of ergodicity which we proposed in Joc. cit. is a rather 
strong one, and can only be approximated by a statistical process. 
It is the purpose of the present note to show that all of our con- 
clusions concerning ergodicity remain valid under a weakening of 
our hypotheses which is designed to bring our algebraic discus- 
sion into closer accord with statistical notions. We shall assume 
that the reader is familiar with the notation and terminology of our 
previous work, and we shall use them freely below without further 
comment. The reader will note, however, that the bold-face italic 
M of Joc. cit. has been replaced by the character ‘i throughout. 

Definition: Let n rational numbers 7,, p,,---, P, be given, such 


n 
that 0 <p, <1 for each z and >i =1. Let ¢ be a positive num- 
i=4 
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ber. A word w of the free monoid Jl, will be called quasz-ergodic 
of order ¢ if 


for each 2. 

It is seen that quasi-ergodicity coincides with the notion or 
ergodicity defined in Joc. cit. when « = 0. We shall now proceed to 
investigate some of the algebraic properties of quasi-ergodicity. 

Theorem 1. The set of words of the free monoid ‘lt, which are 
quasi-ergodic of order e with respect to the rational numbers 7,, 
..», p, form a submonoid of Il,. 

Proof. We must show that if w,, w, are quasi-ergodic, then 
their product w,w, is also. By definition, we have 


A; (4) A;(%2) 
Sat ‘| €,y Govan ee 
or 
|A(w,) - PA,)| <eA(w,), |, (9) ~ p;A(w,)| < €A(w,). 


Adding these inequalities, and remembering that A is a homomor- 
phism, we find 


| (wy) — B,A (m4) | +14; (wa) — p:A(wg)| < €A(w yw). 
Now we always have | a+ 5] <|a|+| |; hence 


1d; (w,) — pr (w,) +A; (Wy) - p;A(w,)| < €A(w Wy), 


or 
| A; (w wy) — p;A(w,w.)|< €d(w,w,), 
or 
MaliSoe bie) I< 
A (w ,W.) Pi : 


which was to be proved. Since the empty word is by definition 
considered to be quasi-ergodic, the theorem is completely demon- 
strated. 

This result generalizes Theorem 4 of loc. cit. Let us agree to 
denote the monoid the existence of which we have just proved by 


E(p;, ©); we shall refer to such a monoid as a quasi-ergodic sub- 
monoid of I, . 
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The task of generalizing Theorem 5 of loc. cit. leads to compli- 
cations. For example, it is readily seen that if we put «> 1, then 
E(p,;, «) = N,. We can obtain a still sharper statement: 

Theorem 2. E(p,, «) = Mt, if and only if the condition 


«> max; (p,, (1 - p;)) (1) 
holds. 
Proof. We first prove the ‘‘if’’ portion of the theorem. It is 


clearly sufficient to show that (1) implies that each generator a, of 
it, is in E(p,, «). If a, is such a generator, then 


Perlciae | | 4 |< 1 
d(a,) P; UE: = max ( ne 
and 
rA.(a-) 
AF Bat eee a Ais ; 
Mista Pil 4, = | pjlS max (p,) <6, 


so that this portion of the theorem is proved. Conversely, suppose 
that there is an index z, 1% n, such that ¢«< max(p,). Then 
corresponding to that index, we have 


A; (@;) 
| d(a,) a D,| ini = lp,|> €, 
from which it follows that a,¢E(p,, e) and hence that E(p,, e) 4 Il,. 
An identical argument applies if for some 7, e < max (1-—p;). Thus 
the theorem is proved. 

Theorem 2 implies that for quasi-ergodic monoids E(p,, e) for 
which e satisfies (1), an analog of Theorem 5 of Joc. cit. is false 
(since Mt, is finitely generated). However, if we restrict ourselves 
to monoids E(p,, «) which do not satisfy (1), then we can prove 

Theorem 3. A quasi-ergodic submonoid E(p;, «) of I, such that 
E(P;5 6) # lis free on a countably infinite set of generators. 

Proof. Suppose the theorem false for some quasi-ergodic monoid 
E(p,;, 6). Then we can find a finite set of words w,, Wy,---+, 
w, in E(p,, e) such that any word weE (p,, e) is a product of the w,. 
Let us make the following definition: a word w of a monoid will be 
called a terminal segment of a word w’ if there exists a word W 
such that mw = w’ (we allow @ to be the empty word). 

Since E(p,, «) + ,, there is a generator of I, (4; 59 say) such 
that a; fEDis e). We now prove 
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Lemma 3.1. There exists a generator Wi 5 of E(p;, e) such that 
no other generator of E(p,, ¢) is a terminal segment of W; "2, for 
any n> 0. 

Proof. Since the number of generators of E'(p,, e) is finite, there 
must exist a generator (which we may call w,, relabelling the w, if 
necessary) such that W 14; , is not a generator of E(p,, e). Consider 
the words w,"@, . If none of these admit a generator of E(p,, e) 
as terminal segment, we are done; otherwise, there is a generator 
(which we may call w,, relabelling if necessary) such that w, is a 
terminal segment of w,"a@, for some n (and hence for all larger n). 
Consider the set of words W"a; The word w, cannot be a termi- 
nal segment of w,"4;, for any n, since these words all end in one 
more @, than w,. Hence if w,"a@; admits a terminal segment, 
which is a generator of E'(p,, e), it must be different from w,, w,. 
We may (relabelling again if necessary) call this generator w,. 
We now form the set "a; and notice that by the same argument 
we have just used, neither w, nor w, can be a terminal segment of 
W,"4, for any n. Hence a generator of E(p;, «) which can serve 
as a terminal segment of some word w,"4;, is distinct from w,, 
W, W,; we may call it w,. Proceeding again in the same manner, 
we find that we shall ultimately contradict the finiteness of the 
set of generators of E(p,, ¢); from this contradiction the lemma 
follows. 

We now return to the proof of the main theorem. We form the set 
of words fw, "4,} using the generator w, of E(p,, ¢) the exist- 
ence and properties of which are given by Lemma 3.1. It is clear 
that 


Aw. "a; nr.(w; 

pescatton hate _DAMMIg)  _ Big (2) 
(W jo Gio iFto nd(w;,)+1 

rip(Wj,"4in) NAi(wj,)+1 

| "o_o" _ 9, | =| —2 2 - |, (3) 


d (Wj 9" Gio) ae nA (wy) +1 


and that, for sufficiently large n, we shall ultimately have (2) and 
(3) < e, by virtue of the fact that w,;,» being a generator of E(p,, e), 
enjoys this property. But none of the words w; "ao admit a gen- 
erator of E(p,, «) as terminal segment. Thus, a fortiori, none of 
these words can be expressed as a product of generators of E(p,, 
€), although we have just ‘shown that for sufficiently large n these 
words are elements of E(p,, «). Hence the assumption that E(p jy €) 
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has a finite set of generators is untenable, and the theorem is 
proved. 

To complete our program, it is necessary to show that Theorem 
6 of loc. cit., which asserts that ergodicity is preserved under 
monomorphisms, can be generalized to (proper) quasi-ergodic sub- 
monoids of Jl,_. The proof of this assertion involves the use of a 
number of simple inequalities, proofs of which will be left to the 
reader. We state the required inequalities below: 


A) If 0<a< band c > 0, then 5 < 5 


Gai 
B)If0<a< band0<c< (a+ 5), then 
la—cl<{b—cl. 


Theorem 4. Let f: 2, — NM, be a monomorphism, and let E(p,, 
«) be a proper quasi-ergodic submonoid of l,. Then f(E(p,, e)) is 
a quasi-ergodic submonoid of ,- 

Proof. We shall show that f(E(p;, ©)) is quasi-ergodic with re- 
spect to the numbers 


nh nh 
Lar Pi D8; 
1 
: n 


5S sarc is See st hE Hews 
pava* 8)? ; Pelee ee UPg 

which were computed in the course of the proof of Theorem 6 of 

loc. cit. We must show (using the notation of loc. cit.) that 


A (w) saree pA 
X(@) — p,| < « implies | 1. (f (00) sls, 


where 1 Sin, j =1, 2, and 7 will be calculated below. By the 
methods that we used in the discussion of the proof of Theorem 6 
of loc. cit, we find that 


7A (10) 
PUTO) eae: aa 


u(f(w)) Vir, + 8,)A,(w) 


1 


2 (4) 


From the quasi-ergodicity of w, we find (since it is no loss of gen- 
erality to remove the sign of absolute value), 


Aw) < (e+ p;)A (w). (5) 
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Using a slightly generalized form of inequality (A), we find from 
(4) and (5) 


cy rd (w) ied; + e)A(w)] 

nh . n ; cs (6) 
Do; + 8;)A,(w) Dies S;) [(p, + )A()] 

i i 

But clearly 

Yirl(p, + )r(w)] Dirdp; + )rA(w)] 

: = ges eT 
di (7, + s,)l(p, + )rA@)I Di (r, + 8;)p;,AW) 

i i 


If now in the inequality (B) above we put 
a = left-hand side of (6), 
6 = right-hand side of (7), 


C=T715 


then it is not hard to verify that the condition for the validity of 
this inequality is fulfilled; the inequality itself becomes 


| (7 (w)) = | | Darl; + A (wv) Lari 
L (f (w)) "a av Se ee Par as 
Lars + $i): (w) LAME SDP; 


1 


But the right-hand side of this inequality is equal to 


Dt 


i at 
@| et = 
Yitrs + S;)P; 


which is in fact <e since the quantity in the brackets in (8) is less 
than 1. An identical argument gives 
n 
8 
| Ha(f(w)) a 
u(f(w)) ‘ 
Lar + 8;)D; 


~1,|<e we 
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If we now put 7 = max(7,, No)> we find that the theorem is com- 
pletely proved. 

Thus, on the basis of Theorems 1-4 above, we have seen that 
the conclusions of loc. cit. are unchanged if we replace the word 
ergodic with the term quasi-ergodic throughout. At the same time, 
we have brought this concept into closer accord with its traditional 
definition. 

We shall now investigate some algebraic notions connected with 
quasi-ergodicity, which do not arise in the strictly ergodic theory 
of Joc. cit. These notions may throw some light on our previous 
discussion as to whether or not there could exist more than one 
DNA-protein code in nature. We recall that our previous discus- 
sion strongly suggested that if more than one distinct code ex- 
isted, then the (strongly ergodic) submonoids of Sl, corresponding 
to these different codes were disjoint. 

It is very easy to verify that two strictly ergodic submonoids 
E pss E ¢a;) of a free monoid are either disjoint or identical, so 
that requiring the DNA-protein code to be strongly ergodic is quite 
consistent with the existence of more than one, or even infinitely 
many, DNA-protein codes in nature. The situation is different in 
the case of quasi-ergodicity. First, we observe 

Lemma 5.1. The intersection of two submonoids of a monoid is 
again a submonoid. 

Proof. Left to the reader. 

We now prove 

Theorem 5. Let E(p;, €,),E (4G; €2) be quasi-ergodic submonoids 
of a free monoid 2. Then the intersection of these submonoids is 
non-trivial (i.e., different from {e}) if and only if | Pim q,| < max 


tes Eo) 
Proof. Put « = max(e,, €,). Then 
X ,(w) A (w) 
E (Djs €,) NE (Qj €) & Iw! | 5 Yes pil<e, Bea q;| <«}. 


That is, p, and q, are both within a distance « of 2 ,(w)/A (w) for 
each w in the intersection. Hence they must be within a distance 
2 from each other. Thus the theorem follows. 

We may remark that the intersection of two quasi-ergodic monoids 
is again quasi-ergodic (we shall not give the proof of this result 
here; the proof is not difficult and may be left to the reader). Hence 
by Theorem 3 the intersection of quasi-ergodic submonoids is sub- 
stantial (i.e., countably generated) whenever it is non-trivial. 
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To obtain a further result, under the hypothesis that the DNA- 
protein coding process is quasi-ergodic, let us examine the fine 
structure of the quasi-ergodic submonoids of J, which are suitable 
for this process. If we denote the generators of n, by By Gq Bay 
a, and write a, = adenine, @, = guanine, @,= cytosine, @, = thy- 
mine, then the Watson-Crick model implies that p, = Py, Pg = Pz 


4 
and (since a p; = 1) that p, -F(1 -2p,). (It may be remarked 


that these relations can be used in conjunction with equations (4) 
of loc. cit. to give further conditions which must hold in naturally 
occurring (strictly) ergodic codes; note that the symbols p,, ..., 
p, above are denoted by 7,,..., 7, in loc. cit.) Hence the knowl- 
edge of one of the p, (p,, Say) completely determines the others, 
and we may restrict our attention to p, alone. 

Let us now suppose that there is a greatest lower bound to the 
numbers ¢ which occur in natural DNA-protein codes; we denote 
this greatest lower bound by e,. That is, if E(p;, €) is a sub- 
monoid of ‘lt, corresponding to a naturally occurring DNA-protein 
code, then «2«. Thus, if E(p,, ©) and E(q;, «) are any two dis- 
tinct such monoids, Theorem 5 implies that we must have 
| Pi - q,| >€)- But p,, g, are restricted to lie in the open interval 
(0, 1); in such a finite interval there can only exist a finite number 
of points which are mutually separated by a distance greater than 
a finite length e,. Thus the choosing of an ¢,, however small, 
restricts the number of acceptable quasi-ergodic submonoids of 
t,, and hence (under our assumption that the DNA-protein coding 
process is quasi-ergodic) the total number of different possible 
DNA-protein codes becomes finite, and in fact must clearly be 
less than or equal to 1/e,. 

The existence of a number ¢, with the property described above 
means that the natural DNA-protein coding processes do not ap- 
proach strict ergodicity arbitrarily closely. This may in turn im- 
ply that coding occurs in nature between DNA molecules and poly- 
peptides the lengths of which are so small that deviations from 
strict ergodicity cannot be neglected. 

We shall conclude this note with a discussion of the circum- 
stances under which all polypeptides may be coded by natural 
codes. In loc. cit. we showed that if there was only one DNA- 
protein code in nature, and if this code was of a strictly ergodic 
character, then not all polypeptides could be coded. It is easy to 
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see that, since a free monoid Sl, cannot be the (set-theoretical) 
union of any number of strictly ergodic submonoids, even the ex- 
istence of a countable number of strictly ergodic codes in nature 
is not sufficient to code all polypeptides. With quasi-ergodicity, 
however, the situation is different. We can show, just as in loc. 
cit., that a single quasi-ergodic DNA-protein code cannot code all 
polypeptides. But it is possible for Sl, to be the union of even a 
finite number of quasi-ergodic submonoids. However, we can prove 

Theorem 6. Let ip), fins p SH}, ip? pares pt, <n 


n 
ines i. pert be r sets of rational numbers such that 2 Pe = 
im 


1 and 0<p,“*?<1. Then there exists a (largest) number 7 > 0 


such that « < 7 implies that L) Ep, << i.. 
k= 1 

Proof. All that is necessary is to select 7 so small that some 
generator a,, say, of Jl, is not an element of any of the quasi- 
ergodic submonoids E (p,“*?, 7). The techniques of Theorem 2 
show how this can be done precisely; the details are left to the 
reader. 

The result of Theorem 6 may now be applied to the case dis- 
cussed above, in which the numbers e¢ occurring in the natural 
DNA-protein codes (assumed to be quasi-ergodic) have a greatest 
lower bound «,. In this case, we have shown that the number of 
distinct codes is finite. If further each of these numbers « is also 
less than 7, where 7 is chosen as in Theorem 6, then not every 
polypeptide can be coded in nature under these circumstances. 


This work was aided by United States Public Health Service 
Grant RG-5181C. 
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Some aspects which involve the interaction of the human element and 
of the machine element in driving are discussed. As an example a sim- 
ple equation is derived for the maximum safe speed of a car on an empty 
road. The parameters of the equation are partly of physiological nature, 
partly of mechanical nature. Another example treats in a similar manner 
the problem of a car passing another car moving in the same direction. 


In recent years a number of mathematical investigations have 
been devoted to problems of automobile traffic (cf., for example, 
Beckman, McGuire, and Winston, 1956; Gerlough, 1955). Most of 
those studies deal, however, with problems of traffic in which a 
large number of vehicles is involved. Problems of waiting at in- 
tersections, of distribution of traffic flow, etc., fall in that cate- 
gory. The theory of probability is found to be a useful tool in such 
problems (Gerlough, 1955). Very little, if anything, has been done, 
to the author’s knowledge, on the problems involved in the driving 
of an isolated car on an ‘‘empty’’ highway. Such a case is of 
course an abstraction. It is, however, not infrequently fairly well 
approximated in reality. This type of problem involves the inter- 
action of the ‘‘human element’’ with the ‘‘machine element’’ and 
may lead to fruitful mathematicobiophysical studies. True enough, 
a vast literature exists on the effects of different psychological 
and physiological factors on driving. However, a systematic the- 
ory, based on biophysical considerations, seems still to be lacking. 
The purpose of the present note is to outline an approach to such 
problems. This note is only the first step in that direction. In sub- 
sequent studies the biophysical approach will be introduced more 
explicitly than here. The relation of this approach to the more devel- 
oped usual one may perhaps be similar to the relation between the 


299 


300 N. RASHEVSKY 


study of interactions of two individual molecules in thermal motion, 
and the behavior of a gas. G. F. Newell (1955) considers the cars on a 
highway as a sort of swarm of molecules. In his discussion he points 
out that the behavior of a gas in thermodynamical equilibrium does not 
depend on the exact law of interaction between the molecules. Yet 
we may remark that when we deal with only very few molecules, 
those laws of interaction do become of importance. The study of 
the behavior of a gas as a whole does in no way make superfluous 
the study of the laws of interaction of individual molecules. 

What determines the maximal safe speed on an empty road? The 
factors are partly functions of the automobile, partly of the psycho- 
physics of the driver. Consider a one-lane road of width s, and a 
car of width s, and length J,. The problem is to keep the car as 
close to the middle of the lane as possible. Actually, due to both 
the imperfectness of the road and the physiological limitations of 
the driver, the car describes a sort of zigzag line. The angle 6 of 
the line with the direction of the road is usually small. It is also 
partly determined by the conditions of the road, partly by the car, 
and partly by the driver. A bumpy road produces shocks which 
exert lateral forces on the car and tend to throw it off the proper 
direction. Hence the smoother the road, the smaller will be the 
angle 6. A loose steering mechanism will increase 6; the ability 
of the driver to make small movements of the steering wheel and to 
feel the proper ‘‘correction’’ also affects 6. Even on a perfect 
road an inexperienced driver drives in zigzags with a large @, be- 
cause of his tendency to ‘‘overcorrect’”’ slight deviations from the 
proper course. Actually, the angle 6 is not constant, but fluctuates 
around some average value. As a first approximation we shall con- 
sider this constant average value and thus schematize the actually 
irregular zigzag line by a uniform one with equal branches at equal 
angles. 

The car should not come closer to the edge of the road than a 
certain minimum distance 5, The driver turns the steering wheel 
when the front wheel is at a distance 5 from the edge (Fig. 1). 
The angle 6 being very small (of the order of magnitude of 1° or a 
fraction thereof), we may consider approximately that the car turns 
away from the edge with one of its rear wheels at a distance § 
from the edge. In other words, in this very crude treatment we neg- 
lect the short but finite process of ‘‘turning’’ at the edge of the 
lane. 
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FIGURE 1 
From Figure 1 we have 
oa s-26 
sin @= : 
ac (1) 
8 
AB = aie (2) 
2 tan @ 


Hence the distance z traveled by the car between two ‘‘turns”’ 
of the zigzag line is 
s-—-26 8 


pA DAB msi ae eg ae] 
/ sin 6 tan 6 ) 


Because of the smallness of 6, (3) may be written thus: 


S—s, —26 8 - —~25-61 
Bt ot poe Se SS (4) 


A 
0 : 7) 
If the speed of the car is v, then the time ¢ between two con- 


secutive turns is 


x 
t= — (5) 
Vv 


or, from (4): 


t= ——*—_-, (6) 


The driver must anticipate the ‘‘turn’’ and react sufficiently 
promptly to this anticipation. Hence ¢ must be sufficiently larger 
than the reaction time T of the driver for this particular stimulus. 
We thus have: 

18 ee (7) 


Hence, from (7) and (6): 
ds (3-3, —-25— Oto) 


ay (8) 


4a 
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or 

eo ea 
Gage 


v (9) 

The maximum safe speed is given by the equality sign in (9). 
Above that speed the car will jump off the road. 

Equation (9) states that the maximum speed for a given size of 
car is the greater, the wider the road (or the narrower the car); the 
smaller the safe distance at which the car may be to the edge; the 
better the road (small 6); and the shorter the reaction time of the 
driver. Those are rather obvious conditions. But equation (9) 
gives a quantitative relation between the different quantities in- 
volved, and can in principle be verified experimentally by using 
simulated driving conditions, such as have been proposed for some 
driver’s tests. 

8,-256 


If 6 es 2 Obes 
Lo 
tive. In other words no driving is possible because a car of that 
length cannot be maneuvered. A sufficient decrease of J, will re- 
verse the sign of the above inequality and make driving possible, 
The dimensions of the car enter implicitly into condition (9) in 


another manner. Obviously we must have 


, the maximum velocity becomes zero or nega- 


z> 1; (10) 


otherwise the car cannot be maneuvered at all. Equations (4) and 
(10) give 


s-8,-25>201,. (11) 


For a fixed angle 6, and for a fixed 8, the width of the lane must 
therefore be sufficiently large, when the length and width of the 
car are prescribed. In fact we have from (11): 


8S>201,+8),+25. (12) 


Of two cars, both of the same width but of different length, the 
longer one will have to travel at a slower speed to be as safe as 
the other. This, however, may be upset by a reduction of 6 for a 
longer and heavier car. 

Expression (11) establishes the maximum speed v,, for an empty 
one-lane road. In it the parameters s, 8), and /, are purely physi- 
cal, The parameter T is a pure physiological one. The parameter 
5 is likely to be affected by both physical and physiological con- 


MATHEMATICAL ASPECTS OF AUTOMOBILE DRIVING 303 


ditions. There is no @ priori reason why we could not have § = 0. 
To what extent it is safe to approach the edge of the lane too 
closely depends on such purely physical conditions as the pos- 
sible jaggedness of the edge, height of the pavement over the 
shoulder, softness of the shoulder, etc. Those are physical fac- 
tors. But physiological factors enter also. An accidental jumping 
off the shoulder may produce a reaction of fear in the driver. It 
will require additional coordinated movements as well as additional 
rapid judgment to put the car back on the pavement. 

The parameter @ is also a composite one. On one hand @ de- 
pends on the condition of the road, as we mentioned at the be- 
ginning. On the other hand, even on a perfectly smooth and hori- 
zontal road, the car will not follow a perfectly straight line, even 
if the steering wheel is held perfectly rigid in a perfectly correct 
position. Deviations from a straight line will be due to small im- 
perfections of the car, such as unequal tires, imperfect weight dis- 
tribution, etc. Finally, even a perfect car on a perfect road will 
not follow a straight line because of unavoidable small motion im- 
parted to the steering wheel by the driver. Since whenever one or 
any two of the three above mentioned factors are imagined to be 
zero, the other still remains, we must consider 6 as the sum of 
three components, 6,, 6,, and 6,. The first is due to road condi- 
tions, the second to the conditions of the car (machine), the third 
—to the human element. 

The component 6, depends both on the dimensions (s, and J) 
of the car, as well as on its design. Other conditions being equal, 
we have 


Oxenet (6,961 5) 2 (13) 


It is this relation that may upset the conclusion that the safe 
speed of a longer car must be slower than that of a shorter car. 

The reaction time T is in general affected by various ‘‘distract- 
ing’’ stimuli, such as road signs, other cars on the road, conditions 
of the road, etc. In a subsequent paper we shall derive expres- 
sions for T in terms of those as well as of other additional factors. 

Expression (9) establishes the maximum speed v,, on an empty 
one-lane road. If the road is not empty, then other factors enter 
into the picture. On most highways traffic in one direction pro- 
ceeds in one or two, rarely three or more, lanes. Neglecting for 
the time being the problem of one car passing another, the limiting 
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factor for a non-empty road lies in the requirement of a sufficiently 
large distance between cars in the same lane to enable the rear car 
to stop in case of a sudden stop of the car in front. This situation 
has been studied by various authors (Beckman, McGuire, and Win- 
ston, 1956; Newell, 1955). We shall discuss it here very briefly. 
Let the distance between two cars along the lane be 7. Let the 
stopping distance of a car traveling at a speed v be p(v). Ap- 
proximately, p increases as the square of v. Thus, a denoting a 
constant, 


p=av?. (14) 


If the car in front stops suddenly, the driver of the car in the rear 
will apply the brakes only after a time interval T,, where T, is his 
reaction time to that particular stimulus. In general T, is different 
from tT used in equation (9). During this time the car will have 
traveled a distance T,v. Hence the ‘‘total stopping distance’’ is 


T vt av?. 
This must be less than 7. Hence 
av?+T,v-1<0. (15) 


The positive root of equation (15) gives the maximum safe speed. 
It is equal to 


Be 2 y 
- ts Te +4a 


- 2a 


(16) 


The actual requirement may be less stringent than (15), because 
the car in front will take a finite time to stop. But for practical 
purposes of safety, even the value of v given by (16) is too high. 

Here again we have both physical (a, 7) and physiological (5 
parameters, 

As another example we shall discuss the problem of one car 
overtaking another on a two-lane highway. 

Let a car move along a two-lane highway at a speed v, anda 
car just behind it at a speed » + Av, Av>O. If the distance be- 
tween the cars is 7, then the second car will overtake the first 
after a time 


<= (17) 
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In overtaking, considerations of safety require that the second 
car go into the left lane when it is at least a distance r behind the 
first car and return into the right lane when it has passed the car 
by a distance r. If s is the total width of the highway, then the 
middle line of the two cars at the moment of passing will be s/2. 
The distance r may be determined by the following considerations. 

Let us first neglect the dimensions of the cars as compared with 
the distance r. Denote by ¢, the time it takes the second car to 
reach the first, counting from the moment it starts going into the 
left lane. The second car from this moment moves at an angle 9 
to the direction of the highway. During the time ¢, it has traveled 
a distance AB (Fig. 2). During this time the first car has traveled 


A’ B= vt. (18) 


FIGURE 2 
We have 
32 
(AB)? = —< +(r+vt,)*. (19) 
This distance having been traveled at a speed v + Av, we have 
AB 
Le meen mE (20) 
v+Av 
Equations (19) and (20) give 
2 
8 
t?(v + Av)? = F +(r+vt,)*, (21) 


or 


2 
8 
[2vAv Bi Releg 2008 =r -r2=0. (22) 
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This has one positive and one negative root. The positive root is: 


gs? 447? 
ry +\f r207 + ar ae [2vAv + (Av)? ] 
E  ——— er 23 
; 2vAv + (Av)? he 


During the time ¢,, the second car travels laterally the distance 
s and must turn to the right after that, lest it soon jump off the 
left stde of the highway. This implies that 


(ie ea (24) 


where T is again the reaction time of the driver. 

It is readily seen from (23) that ¢, increases with r for a given v. 
Hence (23) and (24) impose for a given v a lower or safe limit r, 
upon r, This makes it possible to establish legal requirements 
for distances of safe passing on a highway with a given speed 
limit, 

Let us find an expression for the total time ¢, the second car will 
be partly or totally in the left lane. Considering that for safe 
values of 7, the angle @ is very small, we proceed as follows. We 
may neglect s. Then we find from either (21) or (23): 


Tr 


t,=—. 
OK 


(25) 
After passing the first car, the second must return to the right lane 
after having moved a distance r ahead of the first car. This takes 
atime ¢, again. Hence 


Qr 


a (26) 


But this holds only if the cars have no dimensions. Actually, 
at the beginning of the overtaking, the distance from the front of 
the second car to the rear of the first is r, After passing, the dis- 
tance of the rear of the second car to the front of the first car is 
7. Referring everything to the fronts of the two cars, we have, be- 
fore passing, a distance r+ /,, and after passing—the same. Dur- 


ing the time ¢,, the first car (or its front) moved a distance Vtg. 
The second car moved a distance 2(r +1,) + vé, at a speed v + AD. 
Hence 


2(r+1,) + vt, 


| gen pee RAs LAER 
z v+Av : , 
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or 


2(r+,) 
TMi Say et (28) 
Among other things equation (28) states the obvious result that 
it takes longer to overtake a large car. 
From (28) we have for the total distance which the second car 
travels in the left lane: 


2(r+1,)(v + Av) 


q=(v+Av)é, = *< 


(29) 
For safety, the second car should not collide with an oncoming 
car. If the traffic in the opposite direction is sufficiently heavy, 
so that the average distance between the cars in not too large, and 
if the distance between the cars in the left lane is 1,, and they 
travel also at the speed v, then, assuming for a moment the over- 
simplified picture of a uniform spacing of cars in the left lane, the 
greatest possible distance between the overtaking car and an on- 
coming one is /,. The relative speed of the two being 20 + Av, 
the cars will collide within the time 
l 


1 
Te GAAS (30) 


Hence, if no collision should occur, we must have 


Cet 4 (31) 
or, from (28), (30), and (31): 
mes > 2 (rit to) ‘ (32) 
2u+Av Av 


r, and Av, inequality (32) gives 


oS ee (33) 
4(27r+1,) 

The smaller Av, the smaller the safe speed v. In this respect 
powerful cars with plenty of reserve power for pick-up are of ad- 
vantage. On the other hand v decreases with increasing 7, But as 
we have seen, r has a minimal safe value r, given by (23) and (24). 
Hence the requirement (33) is to be replaced by 

(1, -27, —21,)Av 


<< ———___—__—__ (34) 
i 4(2r, +1.) 


For a given /,, 
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All the above holds only for the case of a uniform spacing of 
cars along the highway. Such a situation never occurs. Actually, 
consider that there are in the left lane N, cars per unit length of 
highway. For not too high concentrations we can neglect the di- 
mensions of the cars. We then have on the average N/ q cars in the 
left lane per stretch g, which is the distance that the overtaking 
car travels in the left lane. According to Poisson’s equation, the 
probability that in any given stretch of length qg there will be no 
cars is equal to : 


| es er 5 eee (35) 
Hence the probability that an overtaking car which travels along 


the left lane for a distance qg given by (29) will not collide with an 
oncoming car is: 


¥ No (2rtlg)(vtAv) 
Py =e Av “ (36) 
The expression 
Py a bes a (37) 


gives us the probability of head-on collision during overtaking. 
This probability increases with N, v, 7, and/,. It decreases with 
increasing Av, 


The author is indebted to Professor Herbert D. Landahl for a 
thorough discussion of the paper, and to Dr. M. Beckman for valu- 
able information regarding the literature on the subject. 

This work was aided by a grant from the Dr. Wallace C. and 
Clara A. Abbott Memorial Fund of the University of Chicago. 
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A hypothetical ‘‘self-reproducing living molecule’’ has been recently 
frequently discussed in the literature. A physical theory of such a mole- 
cule is tentatively outlined. The relational, rather than the quantitative, 
aspects of the process are emphasized. The process of synthesis of 
such a molecule from its components is considered as a transition of a 
complex system between two quantum-mechanical states. The require- 
ment that a ‘‘living molecule’’ should be able to multiply only by self- 
reproduction but not by a formation de novo leads to the conclusion that 
the above mentioned transition is a forbidden one. A constellation of 
conditions under which the forbiddenness is lifted by the presence of 
an already existing ‘‘living molecule’’ is suggested. The outlined 
mechanism leads to the conclusion that the property of self-reproduction 
does not so much reside in the self-reproducing molecule itself as in its 
relations to other molecules of a system. 


Hitherto, the systematic development of mathematical biophysics 
has been based almost exclusively on classical physics. Cells, 
though microscopic in the ordinary meaning of the word, appeared 
to be still ‘‘matter in bulk’’ from the point of view of the molecular 
or atomic level. However, numerous attempts at developing a 
mathematical theory of cell multiplication based on classical 
physics only must be considered on the whole a failure, in spite of 
the fact that the theory did lead to a correct prediction of some 
quantitative phenomena involved in cell division (Rashevsky, 
1959c). 

The need for a ‘‘molecular mathematical biophysics’’ (Rashev- 
sky, 1940) became apparent in connection with the studies of ef- 
fects of different radiations. The fact that a single a-particle, for 
example, may kill a cell which consists of some 1014 atoms and 
roughly 101° molecules indicates that a single molecule may con- 
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trol the activities of a very large number of others. An interpreta- 
tion of this fact by a sort of ‘‘chain reaction’’ was suggested by 
N. Rashevsky (1940). This suggestion was, however, still based 
on concepts of classical macrophysics. 

Further experimental developments in various fields of biology 
continued to indicate the need of molecular physics in the interpre- 
tation of different biological phenomena (Delbriick, 1946). Re- 
cently the evidence for such a need has been supplied by A. Szent- 
Gyorgyi (1957) in connection with his studies of the energetics of 
muscular contraction. The whole field of virology (Pollard, 1953; 
Luria, 1953) also definitely indicates that ‘‘life begins at the 
molecular level.’’ 

On the other hand the principle of biotopological mapping, pro- 
posed by N. Rashevsky (1954) and developed since by him and his 
coworkers (N. Rashevsky, 1956, 1958a, 1959a, b; E. Trucco, 1956, 
1957a, b; Robert Rosen, 1958a, b, 1959a, b), especially by Robert 
Rosen, also suggests that the simplest structure which still pos- 
sesses the principal relational characteristics of an organism may 
be looked for on the molecular level. The principle of biotopologi- 
cal or simply, biological, mapping brings into focus the relational 
aspects of biology, which in many instances may prove to be more 
basic than the quantitative or metric aspects, though both aspects 
must be considered for a full description of a biological system. 
We have tentatively suggested (Rashevsky, 1954) that the metric 
aspects of our universe may be reflected in the laws of physics, 
while its topological aspects may be reflected in biological phe- 
nomena. 

To the extent that we know of biological phenomena only through 
their physical manifestations, a close connection between physi- 
cal phenomena and biological phenomena must exist. Whether this 
connection amounts to a complete reducibility of biology to present 
day physics, or whether an extension of present day physics may 
be necessary to include phenomena of life, is a question which 
cannot be answered at present. The possibility of the biological 
phenomena following physical laws, and yet not being derivable 
from physics, should also not be overlooked in connection with 
similar situations in pure mathematics, as exemplified by Gédel’s 
Theorem. 

The obvious thing for a mathematical biologist to do is to try to 
investigate as many physical interpretations of basic biological 
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phenomena as possible. Success or failure will indicate which 
road to take next. In our early studies of mathematical biophysics 
we began with the study of cell multiplication as the most im- 
portant of all biological phenomena. Multiplication of an organism, 
be it a metazoan or a virus, is still the central phenomenon in 
biology. Having, however, come down to the molecular.level, we 
now must center our interest on a theoretical interpretation of 
multiplication or self-reproduction of the individual ‘‘living’’ mole- 
cule which seems to form the actual basis of life. 

We must, however, keep in mind that even a ‘‘molecular’’ ap- 
proach to biological phenomena does not necessarily imply that 
there exists such a thing as a ‘‘self-reproducing living molecule.”’ 
The process of multiplication of a biological system may be a 
characteristic of the system as a whole and not manifest itself in 
any of the components of the system when those components are 
taken alone, isolated from their systemic interrelations. Such a 
view has, for example, been taken by A. I. Oparin (1957). Oparin 
also refers to C. N. Hinshelwood’s (1947; also Dean and Hinshel- 
wood, 1955) sceptical attitude toward the theory that the gene is 
endowed with a ‘‘mystical ability to reproduce itself.”’ 

In a recent paper (Rashevsky, 1959b) we outlined a possible 
quasi-relational approach to the problem of ‘‘systemic’’ self-repro- 
duction, pointing out, however, that as long as opinions on the 
subject are divided, and evidence for either of the two views is 
inconclusive, we should not take a prejudiced point of view and 
should keep in mind other possibilities. It is in the spirit of this 
attitude that we propose to discuss here a possible approach to the 
‘‘molecular’’ theory of self-reproduction. The discussion in this 
paper seems to indicate, however, that even though such a ‘‘molec- 
ular approach’’ may be successfully developed, it is likely to 
carry within itself the elements of a ‘‘systemic’’ point of view. On 
the basis of the mechanisms of self-duplication of a molecule out- 
lined in this paper, we would expect that such self-reproducing 
molecules exhibit their property of self-reproduction only when they 
are basically parts of other, more complex, molecular systems. 
This does not prove of course that physical mechanisms may not 
be considered which would result in the self-reproduction of a 
single molecule in a different manner from that suggested here. 

In dealing with the process of self-reproduction of a single mole- 
cule we can no longer use the methods of classical physics and 
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must take recourse to quantum mechanics, even if only in its most 
general aspects. 

The relational aspects of biology show themselves on the lowest 
imaginable level of life. It is therefore interesting to note that of 
all branches of physics, quantum mechanics shows more relational 
aspects than any other, with the exception perhaps of the theory of 
crystallographic space lattices. In fact, it hardly would be an 
exaggeration to say that most of the ‘‘practical’’ general applica- 
tions of quantum mechanics are based more on its relational than 
on its metric aspects. 

The original development of quantum mechanics as based on 
Schroédinger’s equation emphasizes definitely the metric aspects. 
But those metric aspects lead to results of a much more relational 
character. The classification of atomic and molecular terms, which 
E. Wigner (1931, p. 118) so aptly calls ‘‘the zoology of terms”’’ is 
much more relational than metric. The metric aspects of a term 
would consist of a more or less complete description of the w-func- 
tion belonging to that term, together with a specification of the 
exact energy level. But such a description and specification is 
available in complete form only for the hydrogen atom. In approxi- 
mate forms such descriptions and specifications are available for 
some more complex atoms and molecules (Eyring, Walter, and Kim- 
ball, 1945; Kauzmann, 1957). But in general we can discuss the 
S, P, D,...ete. terms, or the &, Il, A,...,...terms and their mu- 
tual relations without knowing either the complete corresponding 
w-functions, or the exact energy levels. 

A step towards an almost complete ‘“‘relationalization’’ of quantum 
mechanics was made by H. Weyl (1926), when he showed that the 
relations derived from Schrédinger’s equation can be directly de- 
rived from the theory of groups without the need of solving 
Schrédinger’s equation, either exactly, or approximately (E. Wigner, 
1931; Eyring, Walter, and Kimball, 1945; L. D. Landau and E. M. 
Lifschitz, 1958). In fact we may say that the results thus obtained 
do not even presuppose the particular form of Schrédinger’s equa- 
tion for the y-functions. All we need is that the differential equa- 
tion for the y-function be an eigenvalue equation with both non- 
degenerate and degenerate eigenvalues, and that it be invariant 
under certain groups of transformation. 

Though this is undoubtedly a purely superficial analogy, yet it 
may be interesting to note that the sets of terms show some simi- 
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larity in their mutual relations to the sets which we have considered 
in topological biology. Thus all the S, P, D,... terms of an atom 
can be mapped on the corresponding terms of another. In particu- 
lar, energy levels which occur as a result of splitting of a degen- 
erate term can be mapped on the single energy level of a non- 
degenerate corresponding term in another atom. If, instead of 
representing the different energy levels of an atom by horizontal 
lines, as it is usually done, we represent them by points, and if 
we connect the representative points which correspond to levels 
between which transitions are possible by lines, we obtain a graph. 
Graphs which correspond to different hydrogen-like atoms can then 
be mapped continuously in a many-to-one manner upon the graph of 
the hydrogen atom. We may even introduce a topology into those 
sets of energy levels or terms, if we assign to each term as a 
neighborhood all those terms to which transitions from the given 
term are possible. 

While all these analogies are probably superficial, they serve to 
underscore the relational aspects of quantum mechanics. 

If we are to interpret the basic biological relations in terms of 
quantum mechanics, it is natural to expect that this interpretation 
will come from the relational rather than from the metric aspects of 
quantum mechanics. We may of course attempt to compute with 
the help of electronic computers the y-function of some highly 
complex molecule in order to see whether this molecule will pos- 
sess the property of self-reproduction. We may even succeed in 
this. It would seem, however, very unlikely that such basic re- 
lational properties as are manifested in the process of multiplica- 
tion, will depend on the detailed quantitative specifications of a 
set of w-functions. All available evidence indicates that the self- 
reproducing molecules of viruses, or the molecules of genes, are 
quantitatively very different from each other. What is invariant is 
a basic set of relations. It seems to us much more plausible that 
the basic phenomena of life, whether on the macroscopic or on the 
molecular level, are due to some basic relational aspects of phys- 
ics. It may be worth mentioning that, even in macroscopic metric 
mathematical biophysics, the relational aspects sometimes are of 
preater importance than the exact quantitative ones. This idea 
underlies the development of the approximation method introduced 
first by N. Rashevsky (1938), and developed and used later by 
H. D. Landahl (1953) and others (Rashevsky, 1959c). In the ap- 
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proximation method we are more interested in the general type of 
functional relation between variables than in the exact quantita- 
tive determination of the functions. 

The purpose of this paper is to suggest a possible quasi-rela- 
tional quantum mechanical approach to the basic problem of multi- 
plication of a hypothetical ‘‘living molecule.” 

Two important features characterize the process of reproduction. 
The new organism, be it a metazoan or a molecule of a virus, al- 
ways originates only if another organism is already present. Omnis 
vivum e vivo. The chemical components of which the organism is 
constituted may all be present in the system. Yet they will not 
combine in the absence of another organism. Referring specifi- 
cally to a living molecule A, the components B,, By,...B,, of 
which A is built may well be present, and all other conditions for 
the reaction 


yee ney (1) 


i=1 


may be fulfilled, but reaction (1) will not take place unless a mole- 
cule A is already present. This phenomenon has been sometimes 
likened to autocatalysis. The analogy is, however, a superficial 
one. In autocatalysis, as in any catalysis, the catalyst merely 
speeds up the reaction, which can proceed also in the absence of 
the catalyst though at a much reduced rate. However, to the best 
of our present day knowledge, the reaction of synthesis of a living 
molecule never occurs spontaneously, at any rate, no matter how 
slow. It occurs only through some mediation of an already present 
living molecule. 

The other important feature is that a living molecule does not 
appear to be a static system. It undergoes catabolic processes, 
which seem to be closely connected with reproduction. The purely 
relational aspects of this phenomenon have been discussed by us 
previously (Rashevsky, 1956) and have been used as a basis for 
a topological interpretation of life. But even from a mere physical 
point of view, the same situation holds. Though there may be 
dissent as to whether the Watson-Crick model for the replication 
of the DNA molecule is completely correct, yet it certainly is thus 
far the best model we have. It is strongly supported by the experi- 
mental work of J. H. Taylor and his coworkers (1957) with tagged 
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nucleic acid. [See, however, L. F. La Cour and S. R. Pelc (1958).] 
It calls directly for a close link between catabolism and repro- 
duction. According to that model, a DNA molecule first splits in 
two parts which stand to each other in the relation of a pattern to 
its mold. Then each part rebuilds a full new DNA molecule by re- 
constructing the ‘‘opposite’’ part. In other words, before the DNA 
molecule multiplies, it simply ceases to exist as such. If a gene 
is a DNA molecule or a combination of DNA molecules, then al- 
though the identity and individuality of each gene may be pre- 
served through generations, this preservation of individuality is 
not of a static but of a dynamic nature. The ‘‘mother gene’’ ac- 
tually has ceased to exist before the two ‘‘daughter genes’’ are 
formed. 

In connection with the principle of biological mapping it may be 
appropriate to point out here that the above feature of multiplica- 
tion holds also for other, more complex, organisms. During mito- 
sis the cell nucleus disappears as such and daughter nuclei are 
reconstituted later. To the extent that a complete cell is con- 
sidered as consisting of a nucleus and cytoplasm, it is logically 
correct to say that preceding reproduction, a cell partially dis- 
integrates. In a metazoan the ovum, carrier of the germ plasm, 
disappears as such, after fertilization, until in the adult stage it is 
again reconstituted, to be fertilized and give rise to a new organism. 

The components B, in (1) are in general not individual atoms, 
but molecules of lesser complexity available in the environment. 
From a quantum mechanical point of view the collection of the 


m 
B, component molecules, which we denote symbolically by Dy B,; 

1 
when those components are at rest, is merely a particular quantum 
state of the molecule A, just as a proton and a resting electron at 
infinite distances represent essentially a particular quantum state 
of the hydrogen atom, namely the state characterized by the value 
n = o of the principal quantum number n. (The number of states is, 
however, in the present case finite.) 

It is also characteristic of living matter, that the different bio- 
chemical reactions in the cell proceed not in a homogeneous solu- 
tion, but in a heterogeneous microstructure of a gel-like nature. 
This restricts the thermal motion of the components B,, at least 
just preceding the synthesis, and makes it possible to consider 
2B, as a special quantum state of A. 
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For heteropolar compounds the range of forces between the dif- 
ferent components B, is rather large. If, however, we consider 
homeopolar compounds, then because of the shorter range of the 
homeopolar bonds, not too much of a separation between the B; 
component of the molecule A is necessary to make the molecule 
A effectively ‘‘broken up.’? Thus the components B, may be held 
in appropriate position on the surface of a catalyst or of another 
structure, yet they will not constitute a molecule A. Only a rela- 
tively small displacement of each component B, will make the sys- 
tem into a molecule A. If we keep this in mind, then the concept 
of the ‘‘nonsynthetized’’ and the ‘‘synthetized’’ molecule A as 
being merely two different states of a quantum mechanical system 
does not appear as strange as at first sight. 

The fact that reaction (1) never occurs spontaneously, when 
translated into quantum mechanical terminology, means that the 
transition represented by reaction (1) is a forbidden one. From 
the point of view of the group-theoretical approach to quantum 
mechanics, the forbiddenness of certain transitions is described 
by a rather general theorem. 

Let j be the operator which corresponds to a given scalar quan- 
tity f. Let yf and w,{® denote the wave functions in the states 
a and 6 respectively. Both states may be in general degenerate 
and therefore the indices z and & vary over (different) ranges of 
values. Let an asterisk denote the complex conjugate value. Only 
such transitions of the scalar quantity f are not forbidden, for 
which 


fuer, wv, P>dg # 0, (2) 


where the integral is taken over the whole configurational space. 
But, from the theory of group representation, it follows that the 
integral is different from zero only if y<® and Ada belong to the 
same irreducible representation (Landau and Lifschitz, 1958, p. 
343 ff.). If we have a vector quantity which is transformed by the 
irreducible representation D,, and if D‘™ and D‘) are the irre- 
ducible pentpnantaiions which are induced by the functions y{® 
and vy correspondingly, then only such transitions are allowed 
for Hes) the direct product D“ x D, x D‘® contains the unity 
(totally symmetric) representation. 

Denote the state of the molecule which corresponds to separated 


m 
components ye B,; by S(£B;); denote the state which corresponds 
i 
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to the synthetized molecule A by S(A). Let r,, be the distance be- 


tween the 7th and &th component molecules in "S(3 B,) and consider 
the scalar quantity 


ry 


(3) 


H 
S 
eA 
Al 
oe 
=> 


The quantity r is the average distance between two components. 
It is evident that the value r, of r in the state S(XB.) is different 
from the value = of r in the state S(A). In fact, though this is not 
important, i. > r The reaction (1) corresponds to a transition 


S(2B,) — S(A), (4) 


and also to the transition 


ri T.- (5) 


In general to each energy level there corresponds a different ir- 
reducible representation of the group with respect to which the 
properties of the molecule are invariant. If, however, the energy 
levels which correspond to S(2B,) and S(A) correspond to dif- 
ferent irreducible representations, then the transition (5) is for- 
bidden, and reaction (1) cannot occur. 

It must be noted that the conditions for forbiddenness of transi- 
tions of scalar quantities are different from those for vector quan- 
tities. The latter can and do in general occur between states of 
different energies. In the case of (dipole) radiation we have a tran- 
sition between two values of the vector of the electric dipole mo- 
ment. Scalar quantities will in general change only between states 
of the same energy (Landau and Lifschitz, 1958). 

Any system is of course invariant with respect to the group of 
rotations in space. It is the irreducible representations of this 
group which lead to the classification of the terms from the point 
of view of the values of the angular momentum. But with the as- 
sumed relative immobility of the B,’s and a fortiori of A, the rota- 
tion group is not likely to play a role. 

Physically, the reason for forbiddenness of certain transitions 
lies in the fact that the energy and angular momentum of the sys- 
tem as a whole is each conserved. A transition between two dif- 
ferent energy levels must be accompanied by an emission or ab- 
sorption of a corresponding amount of energy, but the difference of 
the angular momentum in the two states must be equal to the angu- 
lar momentum of the emitted, respectively absorbed, photon; that 
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is, to h/27. That is why transitions between states of different 
energy, which also differ in their angular momentum by more than 
one unit (h/27), are forbidden. 

We consider the components B, as electrically neutral so that 
the emission of radiation during their rearrangement does not oc- 
cur. Therefore, all the above group-theoretical argument boils 
down to the statement that transition (1) can occur only if the 
states A and 2B, have the same (potential) energy. This means 
that the corresponding energy level is at least doubly degenerate. 
But the existence of degenerate levels implies the existence of 
multidimensional irreducible representations of the symmetry group 
of A. If, as seems to be the case with very complicated large 
molecules, A does not possess any symmetry, except of course C,, 
then degenerate levels will never occur, and the energies of the 
states A and 2B, will always be different. 

It is of course conceivable that the states S(2B,) and S(A) 
correspond to the same energy if A possesses even some low sym- 
metry. It is, however, known empirically that the breakdown of 
living matter is an exothermic reaction, and therefore the synthe- 
sis must be endothermic. Hence the energy FE, is greater than the 
energy E'y B;* 

The electronic terms in all the components B, are likely to cor- 
respond to the lowest states, as is the case with most organic 
molecules (Landau and Lifschitz, 1958). It is therefore conceiv- 
able that some of these electronic states become excited in $ (2 B,) 
giving a total energy for S(2B,) equal to E,. Such an excitation 
could be produced by some external factor, and in that case, since 
E’'y,, would be equal to E,, the transition r, _, r,, and therefore 
reaction (1) could proceed spontaneously. Hence, such a case 
would also not correspond to the synthesis of a living molecule. 
For this reason we shall exclude it and consider that we have 
always 


E44 > Exp. (6) 


The breakdown of the molecule A may occur either by way of 
liberating one or more of the components B ; or of aggregates of 
those components, or it may proceed in such a way that some of the 
components B, are themselves broken up. Again, the result of the 
breakdown of A may be considered as a quantum state of A, which 
we shall denote by S(2 A). To it corresponds the energy E ya, < Ey. 
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Now consider a system consisting of two systems SB. and one 
molecule A. Let the systems be sufficiently close to Sach other 
to interact, but sufficiently far from each other, so that the energy 
E, of interaction is very small. Suppose that we have 


E,-Ey,4,=-2(Ey,,~-E,). (7) 


The above considered states S(2B,), S(A), and S(= A;) may in 
general be degenerate, so that to each there corresponds a set WU, 
of eigenfunctions. For simplicity let us first consider the case in 
which neither of those states is degenerate. In that case each 
state is completely characterized by one eigenfunction. Let the 
three eigenfunctions be denoted by w“8), yw“, and yi. Each of 
them is invariant under any transformation of the group. 

The system composed of one A and two 2B, has a total energy 


E,+2Ey,. (8) 


Because of (7), the same energy characterizes a system com- 
posed of two molecules in state S(A) and one in state S(2A,). In 
other words, if we neglect the disturbance due to the interaction 
energy E,,, the whole system has a doubly degenerate energy level, 
given by (8). The eigenfunctions of this energy level are 


[yBi]2 yo? and [yA]? y ld, (9) 
If everything were like this, then the states 
S(A) + 28(2B,) (10) 
and 
S(A,) + 2S(A) (11) 


would correspond to the same doubly degenerate energy level (8), 
and transitions could take place spontaneously between state (10) 
and state (11). But state (10) corresponds to one molecule A, and 
two aggregates 2 B,; state (11) corresponds to one disintegrated 
molecule A, namely to 2 A,, and to two newly formed molecules A. 
Hence, if we had everything that way, the disintegration of the 
present molecule A would cause the formation of two new mole- 
cules A from appropriate components B,. We thus would have a 
process of multiplication of the molecule A, or a process which 
requires the presence of an already existing molecule A as well 
as the destruction of that existing molecule A. 
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However, no matter how small, the disturbance £, will in general 
lift the degeneracy of the energy level, (8). The level will split 
into two levels, so that states (10) will belong to a different, 
though very close, energy level than (11). A transition between 
the states still will not take place. If, however, as we assumed, 
the disturbance E, is very small, then the splitting AF of the energy 
levels will also be very small. The splitting AF is a function 
f(E;) such that 


f(0) = 0. (12) 


The energy levels are not absolutely sharp. They have a width 
dE equal to 


h 
5E = = (13) 
ss 


where / is the Planck constant and T the average life span of the 
state (Landau and Lifschitz, 1958). If £, is so small that 


AE =f) <=, (14) 


then the unsharpness of the energy levels will obliterate the small 
difference AE, and within the limits of any possible physical 
discrimination, the states (10) and (11) will belong to the same 
energy level and the transition 22B, —> 24 will take place simul- 
taneously with the transition A+ 2A;. The mechanism is made 
possible by the unsharpness of the energy levels, which makes the 
degeneration, so to say, ‘‘below threshold.”’ 

In case of transitions between different states it is the state 
with the largest T which determines AE. Therefore, if in the con- 
templated reaction, one of the states is completely stable, having 
T= oo, then AF =0 and the argument does not hold. Hence it is 
necessary to assume that all the states involved are unstable. In 
other words, all the intermediate stages must be unstable. The 
whole process of self-replication must therefore be that of a dy- 
namic flux, and not a succession of stable states. This is biologi- 
cally quite plausible. 

The level width 5£ must, however, satisfy another inequality, 
in addition to (14), to make the described phenomenon observable. 
The phenomenon hinges on the equality (7). If the energy levels 
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were absolutely sharp, the probability of the equality (7) being 
satisfied by chance would be zero. If, however, 5E is small but 
finite, this probability becomes also finite. 

Equality (7) may be written 


Ex 4, = 22 ap, -E,- (15) 


Let EF, and E's, be chosen arbitrarily. Let E be the total range 
of possible values of Ey, . Then the probability that (15) is 
satisfied with an accuracy of 5E equals 

SE (16 

E? 
if there is only one possible level Ey4;- This means that A can 
break up in only one possible way. If A is composed of N atoms 
then the total number of ways in which A can break up is very 
large if N is large. The total number of ways is equal to the num- 
ber of partitions of N. Not all of those ways will, however, be 
physically possible. But let this number be denoted by P(N). 
Then the probability p that for any one of the P(N) breakdowns of 
A, the equality (15) is satisfied within the limits of error dE is: 


P(N) 8E 
2 = 
E 


(17) 


If this probability p is to be of the order of magnitude of 1, we 
must have 


h E 
Te SAAN) 
From (18) we have 
Et 
P(N) ~ Ras (19) 


With plausible values of E and T, we may expect P(N) to be 
very large. The number N will be much smaller than P(N), but 
still it is to be expected that it will be of the order of several 
powers of ten. Thus, if such self-reproducing molecules exist they 
must be found amongst the macromolecules. For small molecules 
the above outlined mechanism of self-reproduction would be almost 
infinitesimally improbable. 
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Basically the same thing will happen if the energy levels E,, 
ESRD and Ey, are degenerate, and each has a set of eigenfunc- 
tions W4), v8), and wfAD, with 7 = 1,2,...,4,;h =1,2,..., 45 
and 1=1,2,...,v,. In that case we obtain instead of a doubly 
degenerate level a level with a v,v,v,-fold degeneration. Due to 
the disturbance E,, it will be split into v,», 1, levels. Again, how- 
ever, if E, is sufficiently small, the splitting will be less than bE. 

The-above model of a self-reproducing molecule is, as we said 
before, discussed in abstracto—as the French say, pour fixer les 
idées. The actual situation with any organism, whether unimolecu- 
lar or not, is different in many respects. One is that the com- 
ponents B, are not as a rule available in a free state, but are parts 
of different other molecules, C, . It is one of the important func- 
tions of the organism to cause a breakdown of the ‘‘food’’ mole- 
cules C’, and thus to make available the different A?s. This cir- 
cumstance throws interesting light on the possibility of interpret- 
ing, in line with the above ideas, the mechanism of replication of 
the DNA molecule as suggested by J. D. Watson and F. H. C. 
Crick (1953). 

The DNA molecule may be schematically represented by the fol- 
lowing structure 


SS EE; 


Liod-vpt bag. (20) 
BiB, — 4 —B. 


Denote the upper complex of B; molecules by A’, that of the B, 
molecules by A. Denote the state of the whole molesila by S(A, A’ ); 
the split molecule A + A’ can again be considered as a particular 
state $(A + A’). Finally the complex A and the dispersed state of 
A’, that is the system A + = BY, as well as the combination A’ + = B5. 
are also states of the Hin ieeuiet which we represent by S(A + = B’) 
and S(A’ + 2 B;). If all those states correspond to different energy 
levels with different irreducible representations, then no transition 
between them can occur. The configurations xB, and = Bi are 
states S(2B,) and S(2B;) of the “‘partial’’ molecules A a A’ 
correspondingly. 

Let 


E,>Exep Eg’ > Exes Ey ae < Egat (21) 
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For the same reason given before, the transition 
SB.) > A; S(2B/) — A’; and S(A, A’) > S(A+ A’) (22) 


will be forbidden. In other words, the synthesis of A and A’ from 
the corresponding B,’s and B;’s will not occur spontaneously. Nor 
will (A, A’) split into A + A’, or be formed from A + A’. 

Let, however, 


By Exe = Eg - Eon. = Eats’ — Es a’ (23) 


If the components A and A’ are sufficiently far removed from each 
other, then we have 


Byjg a? = Ey + Ey (24) 
Equations (24) and (23) give: 
Eye ~ Eye = Eg + Eye - Eg 4’; (25) 
or 
E,+Eype-E, 4’ = 0. (26) 
Again 
BE, +Eyg: =Egsnpy 3 (27) 
therefore, 
E4+aBy =H a,a’- (28) 
Similarly, 
E4’+3B, ty enti: | (29) 


Hence by the same argument stated before, the transitions 
S(A +B) — S(A,4’); S(A° + 2B,) — S(4, 4’) (80) 


can occur spontaneously if (23) holds. In other words either A or 
A’ will rebuild the complete AA’. The third inequality (21) is, 
however, a prerequisite for the whole argument. If B,,,-=E, 4, 
then from (23) it would follow that E,- = Exp, and E, = Ey : 
which would imply the possibility of a ee aitaseons formation of A 
from the B,’s and of A’ from the B;’s. 

Consider now the more realistic case that the B,’s and B;’s are 
not present in a free state but are bound as components of some 
other molecules C, and C7. 
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Let 
C,—> 2B, +R, and C; + 2Bi+ PR; (31) 


represent the breakdowns of C, and C; into B; and B; and some re- 
sidual components. If 


Ec, = Ey B,+R; = Egy a7 - E43 Ec: "7 E's Bl +R; a -E,, (82) 


and if we now consider the states of the system composed of AA’, 
C,, and C;, then the simultaneous splitting of AA’ into A + A’ and 
the breakdown of C, and C; occur without energy change, and the 
corresponding transitions can occur; in other words, the energy of 
the reaction (31) supplies the necessary energy for the splitting of 
AA’. Once AA’ is split into A and A’, each rebuilds a complete 
AA’, 

We thus bring into the system also the part of the environment 
which provides the ‘‘food’’ components for AA’. The molecule AA’ 
cannot split unless simultaneously the food components B, and 
B* are liberated from C; and C7. But unless AA’ splits, no new A 
or A’ and therefore no new AA’ can be formed. This emphasizes 
the ‘‘systemic’’ character of the whole process of reproduction, as 
mentioned earlier on page 311. 

Returning to the first scheme, we may in general consider a 
chain of such reactions. The breakdowns of the living molecule A 
may produce, by the mechanism. discussed above, a molecule A, 
which is different from A. The A, produces A,, and so forth, un- 
til A, produces two A’s. Some of the A,’s may be ordinary cata- 
lysts, some may provide the structural framework within which the 
reactions occur. This framework also eliminates the translation 
energies of the components (p. 315). 

Finally, we may have a situation where the breakdown of a 
molecule A produces components of another living molecule A“); 
the breakdown of A“? produces components of As, such that 
they are ‘‘self-sufficient,’’ and form a ‘‘multimolecular’’ organism. 


The author is indebted to Mr. Robert Rosen for a critical dis- 
cussion of this paper. 


This work has been aided by United States Public Health Serv- 
ice Grant RG-5181. 
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